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(54) Title: MUTATIONS IN THE MYOSTATION GENE CAUSE DOUBLE-MUSCLING IN MAMMALS 



100 bp 




P94L 

ATTGATCAGTTCGATGTC 
I D Q F D V 
ATTGATCAGTTAGATGTC 
ID Q L D V 



nt419(del7-lnsl0) 
TGCTTCTTTAAATT... TACC 
C F P K P S 
TGCTTCTAAGCATACAATAGC 
CP* 



nt374-51{T-C) 
GTTTGTTnTCAT 
CTTCATTTTTCAT 
nt374-50(G-A) 




nt 7 49-76 (dol l) 
QATTAGTTTTCTTTCTTT 
GATTAGTTT_CTTTCTTT 



Q204X 




nt821(delll) 


GGTATTTGGCAGAGCATT 




TGTGATGAACACTCCACAGA 


G I W Q S I 




C D E H S T E 


GGTATTTGGTAGAGCATT 




TGTG APAfiA 


G I W * 




C R X(ll>* 



V 



nt374-16(d«ll) 
TTTTTTTCTTATTCA 
TTTTTT„CTTATTCA 



nt414(C-T) 




E226X 




C313Y 


CCCAAATGTTGCTTCTTT 




TTAGGCATTGAAATCAAA 




TCTGGAGAATGTGAATTT 


P K C C P F 




L O I B I K 




S G E C B P 


CCCAAATGTTGTTTCTTT 




TTAGGCATTTAAATCAAA 




TCTGGAGAATATGAATTT 


P K C C P F 




L G X • 




S G B Y E P 



(57) Abstract 

A gene (cDNA) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as SEQ ID NO: 1 and the protein 
sequence is identified as SEQ ID NO:2. A mutant gene (SEQ ID NO:3) in which the coding sequence lacks an 1 1-base pair consecutive 
sequence (SEQ ID NO: 11) of the sequence encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are double-muscled. A method for determining 
the presence of muscular hyperplasia in a mammal is described. The method includes obtaining a sample of material containing DNA from 
the mammal and ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity of myostatin, is present, and 
whether a sequence of the DNA encoding (b) an alleUc protein lacking the activity of (a), is present. The absence of (a) and the presence 
of (b) indicates the presence of muscular hyperplasia in the mammal. 
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MUTATIONS IN THE MYOSTATIN GENE CAUSE DOUBLE-MUSCLING IN MAMMALS 
Field f The Inventi n 

This invention relates to factors affecting muscle development in mammals, 

especially livestock. In particular, this invention relates to the cloning of the myostatin gene, a 

5 member of the TGF-p superfamily, its involvement in muscular hyperplasia in livestock, and a 

method for determining myostatin genotypes. 



Description Of Related Art 

The TGF-p superfamily consists of a group of multifunctional polypeptides which 
control a wide range of differentiation processes in many mammalian cell types. GDF-8 is a 
0 member of the TGF-p superfamily. All members of this superfamily share a common structur 
including a short peptide signal for secretion and an N-terminal peptide fragment that is 
separated from the bioactive carboxy-terminal fragment by proteolytic cleavage at a highly 
conserved proteolytic cleavage site. The bioactive carboxy-terminal domain is characterized by 
cysteine residues at highly conserved positions which are involved in intra- and intermolecular 
5 disulfide bridges. The functional molecules are covalently linked (via a S-S bond) dimers of the 
carboxy-terminal domain (Masterson etaL % 1996). 

Recently, it was reported that mice deficient in the gene coding for GDF-8 were 
characterized by a generalized muscular hyperplasia (McPherron etal., 1997). The GDF-8 
deficient mice were produced by gene targeting using homologous recombination in embryonic 
stem cells, a method referred to as 'gene knock-out". The murine generalized muscular 
hyperplasia appeared to be very similar in its expression to the muscular hyperplasia 
characterizing "double-muscled" cattle. This observation raised the intriguing possibility that the 
bovine gene coding for GDF-8 (i.e. the bovine evolutionary homologue of the mouse GDF-8 
gene) is involved in the bovine double-muscling phenotype. It also raised the possibility that the 
human gene coding for GDF-8 (i.e. the human evolutionary homologue of the mouse GDF-8 
gene) is involved in regulating muscular development in humans, specifically skeletal muscle 
genesis. Isolation of the human GDF-8 gene may have therapeutic uses/applications in the 
treatment of musculodegenerative diseases through upgrading or downgrading the expression of 
GDF-8. 

The occurrence of animals characterized by a distinct generalized muscular 
hypertrophy, commonly known as "double-muscled" animals, has been reported in several cattle 
breeds around the world. The first documented description of double-muscled cattle dates back 
as early as 1807 (Culley, 1807). One of the breeds in which this characteristic has been most 
thoroughly analyzed is the Belgian Blue Cattle Breed ("Belgian Blue Breed"). This is one of the 
only breeds where the double-muscled trait has been systematically selected for, and where the 
double-muscled phenotype is virtually fixed. A comparison of double-muscled and conventional 
animals within the Belgian Blue Breed, showed an increase in muscle mass by 20% on average, 
while all other organs were reduced in size (Hanset, 1986 and 1991). The muscular hypertrophy 
was shown to be an histological hyperplasia affecting primarily superficial muscles, 
accompanied by a 50% reduction in total lipid content and a reduction in connective tissue 
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fraction as measured by hydroxyproline content (Hanset et aL 9 1982). Double-muscled animals 
were shown to have a reduced feed intake with improved feed conversion ratio (Hanset etai, 
1987). An important economic benefit of double-muscled animals, in contrast to conventional 
animals, is the substantial increase in selling price and net income for the farmer (Hanset et a/., 
5 1987). 

One of the most thorough series of studies on double-muscling is that of Hanset 
and colleagues in the Belgian Blue Breed. Objective criteria of muscular development, such as 
dressing-out percentage, lean and fat percentage, plasma and red cell creatine and creatinine 
concentrations, were measured on nearly 150 randomly selected animals raised in standardized 

1 0 conditions. These studies clearly revealed abnormal, bimodal distributions of the double- 
muscled phenotype and objectively confirmed the visual classification traditionally performed by 
breeders on double-muscled and conventional animals. The phenotypic distribution was 
resolved using a maximum likelihood procedure into two component normal populations with a 
common variance which revealed mean differences of three to four standard deviations 

15 depending on the trait. This suggested the presence of an allele having a major effect on 

muscular development with a population frequency close to 50% (Hanset and Michaux, 1985b). 
The most convincing evidence in favour of such an allele, however, came from experimental 
crosses involving double-muscled Belgian Blue sires and Holstein Friesian dairy cows (the latter 
animals having very poor muscular development). While F1 offspring showed a phenotypic 

20 distribution very similar to their Holstein Friesian dams, backcrossing these F1 's to double- 
muscled sires produced a bimodal BC generation, clearly pointing towards the Mendelian 
segregation of a recessive u mh" (muscular hypertrophy) allele (Hanset and Michaux., 1985a), 

The same kind of experimental crosses were subsequently used to perform a 
whole genome scan using a microsatellite based marker map. To perform the linkage analysis, 

25 animals were classified as double-muscled or conventional. Very significant Logarithm of the 
Odds scores (lodscores) were obtained on chromosome 2 (> 17), and multi point linkage 
analysis positioned the mh locus at the centromeric end of this chromosome, at [2]centimorgan 
from the nearest microsatellite marker: TGLA44. The corresponding chromosomal region 
accounted for all the variance of the trait assumed to be fully penetrant in this experiment 

30 (Charlierefa/.,1995). 

In humans, genes coding for some forms of muscular abnormalities have been 
isolated, e.g. muscular dystrophy. The present invention provides for the gene which regulates 
the development of skeletal muscle only, as opposed to other types of muscle, e.g. smooth or 
cardiac muscle. The present invention may provide an understanding of the role of the GDF-8 

35 gene or its receptor in the regrowth of skeletal muscle in humans which only undergo a 
hyperplasic response. 
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Summary f the Invent! n 

The present inventors have identified and sequenced a gene (cDNA and 
genomic) encoding a bovine myostatin protein. The nucleic acid coding sequence is identified as 
SEQ ID NO:1 and the protein sequence is identified as SEQ ID NO:2. The genomic bovine 
5 sequence is identified as SEQ ID NO:54. A mutant gene (SEQ ID NO:3) in which the coding 
sequence lacks an 1 1-base pair consecutive sequence (SEQ ID NO:1 1) of the sequence 
encoding bovine protein having myostatin activity has been sequenced. It has been shown that 
cattle of the Belgian Blue breed homozygous for the mutant gene lacking myostatin activity are 
double-muscled. Other bovine mutations which lead to double-muscling in have also been 
10 determined, being identified herein as nt419(del7-ins10), Q204X, E226Xand C313Y, 
respectively. 

In one aspect, the present invention thus provides a method for determining the 
presence of muscular hyperplasia in a mammal. The method includes obtaining a sample of 
material containing DNA from the mammal and ascertaining whether a sequence of the DNA 
15 encoding (a) a protein having biological activity of myostatin, is present, and whether a sequence 
of the DNA encoding (b) an allelic protein lacking the activity of (a), is present. The absence of 
(a) and the presence of (b) indicates the presence of muscular hyperplasia in the mammal. 

Of course, the mutation responsible for the lack of activity can be a naturally 
occurring mutation, as is the case for the Belgian Blue, Asturiana, Parthenaise or Rubia Gallega 
20 breeds, shown here. 

The mammal can be a human, bovine, etc. 

There are several methods known for determining whether a particular 
nucleotide sequence is present in a sample. A common method is the polymerase chain 
reaction. A preferred aspect of the invention thus includes a step in which ascertaining whether a 

25 sequence of the DNA encoding (a) is present, and whether a sequence of the DNA encoding (b) 
is present includes amplifying the DNA in the presence of primers based on a nucleotide 
sequence encoding a protein having biological activity of myostatin. 

A primer of the present invention, used in PCR for example, is a nucleic acid 
molecule sufficiently complementary to the sequence on which it is based and of sufficient length 

30 to selectively hybridize to the corresponding portion of a nucleic acid molecule intended to be 
amplified and to prime synthesis thereof under in vitro conditions commonly used in PCR. 
Likewise, a probe of the present invention, is a molecule, for example a nucleic acid molecule of 
sufficient length and sufficiently complementary to the nucleic acid molecule of interest, which 
selectively binds under high or low stringency conditions with the nucleic acid sequence of 

35 interest for detection thereof in the presence of nucleic acid molecules having differing 
sequences. 

In preferred aspects, primers are based on the sequence identified as SEQ ID 
NO:7 (human cDNA sequence) or SEQ ID NO:54. 
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ln another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a mammal which includes obtaining a sample of material containing 
mRNA from the mammal. Such method includes ascertaining whether a sequence of the 
mRNA encoding (A) a protein having biological activity of myostatin, is present, and whether a 
5 sequence of the mRNA encoding (B) a protein at least partially encoded by a truncated 

nucleotide sequence corresponding to substantially the sequence of the mRNA and lacking the 
activity of (A), is present. The absence of (A) and the presence of (B) indicates the presence of 
muscular hyperplasia in the mammal. 

The mRNA encoding (A) and the truncated sequence can correspond to alleles 

10 of DNA of the mammal. 

Again, if an amplification method such as PCR is used in ascertaining whether a 
sequence of the mRNA encoding (A) is present, and whether a sequence of the mRNA encoding 
(B) is present, the method includes amplifying the mRNA in the presence of a pair of primers 
complementary to a nucleotide sequence encoding a protein having biological activity of 

15 myostatin. Each such primer can contain a nucleotide sequence substantially complementary, 
for example, to the sequence identified as SEQ ID NO:7. The truncated sequence can contain at 
least 50 consecutive nucleotides substantially corresponding to 50 consecutive nucleotides of 
SEQ ID NO:7, for example. 

In another aspect, the invention is a method for determining the presence of 

20 muscular hyperplasia in a mammal which includes obtaining a tissue sample of containing 
mRNA of the mammal and ascertaining whether an mRNA encoding a mutant type myostatin 
protein lacking biological activity of myostatin is present. The presence of such an mRNA 
encoding a mutant type myostatin protein indicates the presence of muscular hyperplasia in the 
mammal. 

25 In another aspect, the invention thus provides a method for determining the 

presence of muscular hyperplasia in a bovine animal. The method includes obtaining a sample 
of material containing DNA from the animal and ascertaining whether DNA having a nucleotide 
sequence encoding a protein having biological activity of myostatin is present. The absence of 
DNA having such a nucleotide sequence indicates the presence of muscular hyperplasia in the 

30 animal. Ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin can include amplifying the DNA in the presence of primers based 
on a nucleotide sequence encoding a protein having biological activity of myostatin. 

In particular, the method can be carried out using a sample from an animal in 
which such a bovine animal not displaying muscular hyperplasia is known to have a nucleotide 

35 sequence which is capable of hybridizing with a nucleic acid molecule having the sequence 
identified as SEQ ID NO:1 under stringent hybridization conditions. 

It is possible that ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
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in the presence of primers based on a nucleotide sequence encoding the N-teiminai and th C- 
terminal, respectively, of the protein having biological activity of myostatin. 

Primers, say first and second primers, can be based on first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein the regions flank a 
5 mutation known to naturally occur and which when present in both alleles of a such an animal 
results in muscular hyperplasia. 

It can also be that DNA of such an animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 

10 sequence of DNA of a such an animal displaying muscular hyperplasia is known to contain an 
1 1-base pair deletion beginning at base pair no. 821 of the coding sequence, and said first primer 
is selected to be upstream of the codon encoding glutamic acid no. 275 and the second primer 
is selected to be downstream of the codon encoding aspartic acid no. 274. 

Also, a DNA of such an animal not displaying muscular hyperplasia might 

15 contain a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2. The coding 
sequence of DNA of such an animal displaying muscular hyperplasia might be known to contain 
an 1 1-base pair deletion beginning at base pair no. 821 . A primer can be selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing the 

20 deletion. 

The animal can be of the Belgian Blue breed. 

In a particular aspect, ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of a primer containing at least a portion of a mutation known to naturally occur 
25 and which when present in both alleles of a said animal results in muscular hyperplasia. 

In another aspect, the invention is a method for determining the presence of 
muscular hyperplasia in a bovine animal which includes obtaining a sample of the animal 
containing mRNA and ascertaining whether an mRNA encoding a protein having biological 
activity of myostatin is present in the sample. The absence of said mRNA indicates the presence 
30 of muscular hyperplasia in the animal. 

A sample containing mRNA can be muscle tissue, particularly, skeletal muscle 

tissue. 

In a particular aspect, the invention is a method for determining the presence of 
double muscling in a bovine animal, involving obtaining a sample of material containing DNA 
35 from the animal and ascertaining whether the DNA contains the nucleotide sequence identified 
as SEQ ID NO:11 in which the absence of the sequence indicates double muscling in the animal. 

In a particular aspect, the animal is of the Belgian Blue breed. 

In another aspect, the invention is a method for determining the myostatin 
genotype of a mammal, as may be desirable to know for breeding purposes. The method 
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includes obtaining a sample of material containing nucleic acid of the mammal, wherein the 
nucleic acid is uncontaminated by heterologous nucleic acid; ascertaining whether the sample 
contains a (i) nucleic acid molecule encoding a protein having biological activity of myostatin; and 
ascertaining whether the sample contains an 00 allelic nucleic acid molecule encoding a protein 
5 lacking biological activity of myostatin. The mammal can be bovine. 

In another aspect the subject is human and (i) includes a nucleic acid sequence 
substantially homologous On the sense of identity) with the sequence identified as SEQ ID NO:7. 

The invention includes a method of increasing muscle mass of a mammal 
having muscle cells in which myostatin is expressed, the method comprising administering to the 
10 mammal an effective amount of a nucleic acid molecule substantially complementary to at least 
a portion of mRNA encoding the myostatin and being of sufficient length to sufficiently reduce 
expression of the myostatin to increase the muscle mass. In a particular aspect, the mammal is 
bovine. 

In another embodiment, the invention is a method of increasing muscle mass of 
1 5 a mammal, including administering to the mammal an effective amount of a nucleic acid 

molecule having ribozyme activity and a nucleotide sequence substantially complementary to at 
least a portion of mRNA encoding myostatin and being of sufficient length to bind selectively 
thereto to sufficiently reduce expression of the myostatin so as to increase the muscle mass. 

The invention includes a diagnostic kit, for determining the presence of muscular 
20 hyperplasia in a mammal from which a sample containing DNA of the mammal has been 
obtained. The kit includes first and second primers for amplifying the DNA, the primers being 
complementary to nucleotide sequences of the DNA upstream and down stream, respectively, of 
a mutation in the portion of the DNA encoding myostatin which results in muscular hyperplasia of 
the mammal, wherein at least one of the nucleotide sequences is selected to be from a non- 
25 coding region of the myostatin gene. The kit can also includes a third primer complementary to a 
naturally occurring mutation of a coding portion of the myostatin gene. 

A particular diagnostic kit, for determining the genotype of a sample of 
mammalian genetic material, particularly bovine material includes a pair of primers for amplifying 
a portion of the genetic material corresponding to a nucleotide sequence which encodes at least 
30 a portion of a myostatin protein, wherein a first of the primers includes a nucleotide sequence 
sufficiently complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic acid 
molecule containing the mutation, the mutation being selected from the group of mutations 
resulting from: (a) deletion of 11 nucleotides beginning at nucleotide 821 of the coding portion of 
SEQ ID NO:1 ; (b) deletion of 7 nucleotides beginning at nucleotide 419 of the coding sequence 
35 and insertion of the sequence AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of 
the coding sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 
coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 313 of the 
coding sequence and insertion of A in place thereof; and combinations th reof. The second of 
the pair of primers is preferably located entirely upstream or entirely downstr am of the selected 
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mutation or mutations. In on kit, a first said primer spans mutation (a) and further comprising a 
third primer which is sufficiently complementary to the nucleotide sequence identified as SEQ ID 
NO:1 1 to prime amplification of a nucleic acid molecule containing SEQ ID NO:1 1 . In another (or 
the same kit), a first said primer is sufficiently complementary to the inserted sequence of 
5 mutation (b) to prime amplification of a nucleic acid molecule containing mutation (b) and further 
comprising a third primer which is sufficiently complementary to the sequence corresponding to 
the 7 nucleotide deletion of mutation (b) to prime amplification of a nucleic acid molecul 
containing the 7 nucleotide deletion of mutation (b). In another (or the same kit), a first said 
primer spans mutation (c) and further comprising a third primer which is sufficiently 

10 complementary to the sequence spanning the corresponding region lacking mutation (c) to prime 
amplification of a nucleic acid molecule lacking mutation (c). In another (or the same kit), a first 
said primer spans mutation (d) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (d) to 
prime amplification of a nucleic acid molecule lacking mutation (d). In another (or the same kit), 

15 a first said primer spans mutation (e) and further comprising a third primer which is sufficiently 
complementary to the sequence spanning the corresponding region lacking mutation (e) to 
prime amplification of a nucleic acid molecule lacking mutation (e). 

The invention includes a purified protein having biological activity of myostatin, 
and having an amino acid sequence identified as SEQ ID NO:2, or a conservatively substituted 

20 variant thereof. The invention includes a purified bovine protein having biological activity of 
myostatin or a purified human protein (SEQ ID NO:8) having biological activity of myostatin. 

The invention includes an isolated nucleic acid molecule encoding a foregoing 
protein. Particularly, the invention includes an isolated nucleic acid molecule comprising a DNA 
molecule having the nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:3 or SEQ ID 

25 NO:7 or which varies from the sequence due to the degeneracy of the genetic code, or a nucleic 
acid strand capable of hybridizing with at least one said nucleic acid molecule under stringent 
hybridization conditions. 

The invention includes isolated mRNA transcribed from DNA having a sequence 
which corresponds to a nucleic acid molecule of the invention. 

30 The invention includes isolated DNA in a recombinant cloning vector and a 

microbial cell containing and expressing heterologous DNA of the invention. 

The invention includes a transfected cell line which expresses a protein of the 

invention. 

■ 

The invention includes a process for producing a protein of the invention, 
35 including preparing a DNA fragment including a nucleotide sequence which encodes the protein; 
incorporating the DNA fragment into an expression vector to obtain a recombinant DNA molecule 
which includes the DNA fragment and is capable of undergoing replication; 
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transforming a host cell with the recombinant DNA molecule to produce a transformant which 
can express the protein; culturing the transformant to produce the protein; and recovering the 
protein from resulting cultured mixture. 

The invention includes a method of inhibiting myostatin so as to induce 

5 increased muscle mass in a mammal, comprising administering an effective amount of an 
antibody to myostatin to the mammal. 

The invention includes a method of increasing muscle mass in a mammal, by 
raising an autoantibody to the myostatin the in the mammal. Raising the autoantibody can 
include administering a protein having myostatin activity to the mammal. 

10 The invention includes a method of increasing muscle mass in a mammal 

including administering to the mammal an effective amount of an antisense nucleic acid or 
oligonucleotide substantially complementary to at least a portion of the sequence identified as 
SEQ ID NO:1 or SEQ ID NO:5, or SEQ ID NO:7. The portion can be at least 5 nucleotide bases 
in length or longer. The mammal can be a bovine and the sequence can be that identified as 

15 SEQIDNO:1. 

The invention includes a method of inhibiting production of myostatin in a 
mammal in need thereof, including administering to the mammal an effective amount of an 
antibody to the myostatin. 

The invention includes a probe containing a nucleic acid molecule sufficiently 

20 complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as to bind 
thereto under stringent conditions. The probe can be a sequence which is between about 8 and 
about 1 195 nucleotides in length. 

The invention includes a primer composition useful for detection of the presence 
of DNA encoding myostatin in cattle. The composition can include a nucleic acid primer 

25 substantially complementary to a nucleic acid sequence encoding a bovine myostatin. The 
nucleic acid sequence can be that identified as SEQ ID NO:1. 

The invention includes a method for identifying a nucleotide sequence of a 
mutant gene encoding a myostatin protein of a mammal displaying muscular hyperplasia. The 
method includes obtaining a sample of material containing DNA from the mammal and probing 

30 the sample using a nucleic acid probe based on a nucleotide sequence of a known gene 

encoding myostatin in order to identify nucleotide sequence of the mutant gene. In a particular 
approach, the probe is based on a nucleotide sequence identified as SEQ ID NO:1 , SEQ ID NO:5 
or SEQ ID NO:7. Preferably, the probe is at least 8 nucleic acids in length. The step of probing 
the sample can include exposing the DNA to the probe under hybridizing conditions and further 

35 comprising isolating hybridized nucleic acid molecules. The method can further include the step 
of sequencing isolated DNA. The method can include the step of isolating and sequencing a 
cDNA or mRNA encoding the complete mutant myostatin protein. The method can include a 
step of isolating and sequencing a functional wild type myostatin from the mammal not displaying 
muscular hyperplasia. 
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The method can include comparing the complete coding sequence of the 
complete mutant myostatin protein with, if the coding sequence for a functional wild type 
myostatin from such a mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from such a mammal is previously unknown, (2) 
5 the sequence determined according to claim 63 or claim 66, to determine the location of any 
mutation in the mutant gene. 

The invention includes a primer composition useful for the detection of a 
nucleotide sequence encoding a myostatin containing a first nucleic acid molecule based on a 
nucleotide sequence located upstream of a mutation determined according to a method of the 
10 invention and a second nucleic acid molecule based on a nucleotide sequence located 
downstream of the mutation. 

A probe of the invention can include a nucleic acid molecule based on a 
nucleotide sequence spanning a mutation determined according to the invention. 

The invention includes an antibody to a protein encoded by a nucleotide 
15 sequence identified as SEQ ID NO:1 , SEQ ID NO:3 or SEQ ID NO:7, or other protein of the 
present invention. 

The invention includes a transgenic bovine having a genome lacking a gene 
encoding a protein having biological activity of myostatin; a transgenic mouse having a genome 
containing a gene encoding a human protein having biological activity of myostatin or containing 

20 a gene encoding a bovine protein having biological activity of myostatin; a transgenic bovine 

having a gene encoding a bovine protein having biological activity of myostatin and heterologous 
nucleotide sequence antisense to the gene. The transgenic bovine can include a gene encoding 
a nucleic acid sequence having ribozyme activity and in transcriptional association with the 
nucleotide sequence antisense to the gene. 

25 The invention includes a transgenic mammal, usually non-human, having a 

phenotype characterized by muscular hyperplasia, said phenotype being conferred by a 
transgene contained in the somatic and germ cells of the mammal, the transgene encoding a 
myostatin protein having a dominant negative mutation. The transgenic mammal can be male 
and the transgene can be located on the Y chromosome. The mammal can be bovine and the 

30 transgene can be located to be under the control of a promoter which normally a promoter of a 
myosin gene. 

Another transgenic mammal, usually non-human, of the invention has a 
phenotype characterized by muscular hyperplasia, in which the phenotype is conferred by a 
transgene having a sequence antisense to that encoding a myostatin protein of the mammal. 
35 The mammal can be a male bovine and the transgene can be located on the Y chromosome. 
The transgene can further include a sequence which when transcribed obtains an mRNA having 
ribozyme activity. 

A transgenic non-human mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype inducible and conferred by a 
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myostatin gene flanked by J oxP sides and a Cre transgene under the dependence of an 
inducible promoter. 

A transgenic non-human male mammal of the invention having a phenotype 
characterized by muscular hyperplasia, can have the phenotype conferred by a myostatin gene 
5 flanked by J oxP sides and a Cre transgene located on the Y chromosome. 

The invention includes a method for determining whether a sample of 
mammalian genetic material is capable of a conferring a phenotype characterized by muscular 
hyperplasia, comprising ascertaining whether the genetic material contains a nucleotide 
sequence encoding a protein having biological activity of myostatin, wherein the absence of said 
1 0 sequence indicates the presence of muscular hyperplasia in the animal. 

Brief Description Of Drawings 

In describing particular aspects of the invention, reference is made to the 

accompanying drawings, in which: 

Figure 1 is a schematic summary of genetic, physical and comparative mapping 
1 5 information around the bovine mh locus. A multi-point lodscore curve obtained for the mh locus 

with respect to the microsatellite marker map is shown. Markers that were not informative in the 

pedigree used are shown between brackets; their map position is inferred from published 

mapping data. Markers and the YACs from which they were isolated are connected by arrows. 

The RH-map of the relevant section of human HSA2 is shown, with the relative position in cR of 
20 the ESTs used. Stippled lines connect microsatellite and Type I markers with their respective 

positive YACs. YACs showing cross-hybridizing SINE-PCR products are connected by the red 

boxes. 

Figure 2(a) shows electropherograms obtained by cycle-sequencing the 

myostatin cDNA sequence from a double-muscled and a conventional animal, showing the 
25 nt821del(11) deletion (SEQ ID NO:11) in the double-muscied animal. The primers used to 

amplify the fragment encompassing the deletion from genomic DNA are spaced apart from the 

remaining nucleotides. 

Figure 2(b) shows the amino-acid sequence of the murine (top row), bovine 

normal (middle row) and bovine nt821del(11) (bottom row) allele. The putative site of proteolytic 
30 processing is boxed, while the nine conserved cysteines in the carboxy-terminal region are 

underlined. The differences between the normal and nt821del(1 1) bovine allele are indicated by 

the double underlining. 

Figure 3 is a schematic representation of the bovine myostatin gene with position 

and definition of the identified DNA sequence polymorphisms. The "A" (clear) boxes correspond 
35 to the untranslated leader and trailer sequences (large diameter), and the intronic sequences 

(small diameter) respectively. The tt B", U C", and "D" boxes correspond to the sequences coding 

for the leader peptide, N-terminal latency-associated peptide and bioactive carboxyterminal 

domain of the protein respectively. Small "e n , T and "g" arrows point towards the positions of the 
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primers used for intron amplification, exon amplification and sequencing and xon sequencing 
respectively; the corresponding primer sequences are reported in Table 1 . The positions of the 
identified DNA sequence polymorphisms are shown as "h", T or "j" lines on the myostatin gene 
for silent, conservative and disrupting mutations respectively. Each mutation is connected via an 
5 arrow with a box reporting the details of the corresponding DNA sequence and eventually 
encoded peptide sequence. In each box, the variant sequence is compared with the control 
Holstein-Friesian sequence and differences are highlighted in color. 

Figure 4 shows the distribution of identified mutations in the various breeds 
examined. The order of the myostatin mutations correspond to Figure 3. All analyzed animals 
1 0 were double-muscled except for the two Holstein-Friesian and two Jerseys used as controls 
(column 1). 

Detailed Description Of Preferred Embodiments 

The method used for isolating genes which cause specific phenotypes is known 
as positional candidate cloning. It involves: (i) the chromosomal localization of the gene which 

15 causes the specific phenotype using genetic markers in a linkage analysis; and (ii) the 

identification of the gene which causes the specific phenotype amongst the "candidate" genes 
known to be located in the corresponding region. Most of the time these candidate genes are 
selected from available mapping information in humans and mice. 

The tools required to perform the initial localization (step (i) above) are 

20 microsateflite marker maps, which are available for livestock species and are found in the public 
domain (Bishop et a/., 1994; Barendse et a/., 1994; Georges e/a/., 1995; and Kappes, 1997). 
The tools required for the positional candidate cloning, particularly the YAC libraries, (step 00 
above) are partially available from the public domain. Genomic libraries with large inserts 
constructed with Yeast Artificial Chromosomes ("YAC") are available in the public domain for 

25 most livestock species including cattle. When cross-referencing the human and mice map, it is 
necessary to identify the positional candidate, which is available at low resolution but needs to be 
refined in every specific instance to obtain the appropriate level of high resolution. A number of 
original strategies are described herein to achieve this latter objective. For general principles of 
positional candidate cloning, see Collins, 1995 and Georges and Andersson, 1996. 

30 In order to allow for cross-referencing between the bovine and human gene 

map as part of the positional candidate cloning approach, HSA2q31-32 (map of the long arm of 
human chromosome 2, cytogenetic bands q31-32) and BTA2q12-22 (map of the arm of bovine 
chromosome 2, cytogenetic bands q12-22) were integrated on the basis of coincidence bovine 
YAC's as described below. 

35 Using a previously described experimental [(normal x double-muscled) x double- 

muscled] backcross population comprising 108 backcross individuals, the mh locus was recently 
mapped by linkage analysis to the centromeric tip of bovine chromosome 2 (BTA2), at 3.1 
centiMorgan proximal from the last marker on the linkage map: TGLA44 (Charlier et a/., 1995). It 
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was also known from previous work that pro-a(lll) collagen (Col3AI) was located in the same 
chromosomal region as the mh locus. Col3AI has been mapped to BTA2q12-22 by in situ 
hybridization (Solinas-Toldo et a/., 1995), while a Col3AI RFLP marker was shown to be closely 
linked to TGLA44 (8=2%)(Fisher et a/., 1 996). This identifies the region flanking Col3AI on the 
5 human map, i.e. HSA2q31-32, as the likely orthologous human chromosome segment. This 
assumption is compatible with data from Zoo-FISH experiments (Solinas-Toldo etal. t 1995) as 
well as mapping data of Type I markers on somatic cell hybrids (O'Brien etal. t 1993), which 
establish an evolutionary correspondence between segments of HSA2q and BTA2. 

In order to refine the correspondence between the HSA2q31-33 and BTA2q12- 

10 22 maps, Comparative Anchored Tagged Sequences or CATS, i.e. primer pairs that would 

amplify a Sequence Tagged Site or STS from the orthologous gene in different species (Lyons et 
a/. f 1996), were developed for a series of genes flanking Col3A1 on the human map and for 
which sequence information was available in more than one mammal. In addition to Col3AI, 
working CATS were obtained for a2(V) collagen (Col5A2), inositol polyphosphate-1 phosphatase 

1 5 (INPP1), tissue factor pathway inhibitor precursor (TFP/J, titin (TTA/), n-chimaerin (CHA/), 

glutamate decarboxylase 67 (GAD1), Cytotoxic T-iymphocyte-associated protein 4 (CTLA4) and 
T-cell membrane glycoprotein CD28 (CD28). The corresponding primer sequences are given in 
Table 1 . 
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Tablel: 





CATS 




INPP1 


UP: 5 CAGCAAAGTCCTTAATGGTAACAAGC 3 1 


DN: 5" GGGTCACTGAAGAAAACGTCCTG 3' 




COL3A1 


UP: 5* CCCCATATTATGGAGATGAACCG 3' 


DN: S AGTTCAGGATGGCAGAATTTCAG 3' 


5 


COL5A2 


UP: 5* GCAAACTGGGYGGRAGCAAGACC 3' 


DN: 5* TTSTTC CTGGGC 1 1 1 1 ATTGAGAC 3' 




TFPI 


UP: 5' AAGCCWGATTTCTGCTTYTTGGAAG 3' 


DN: S TGCCMAGGCAHCCRCCRTACTTGAA 3' 




TTN 


UP: 5* GGTCGTCCTACACCAGAAG 3 f 


DN: 5* GGTTGACATTGTCAAGAACAAG 3' 




CHN 


UP: 5" TCTCMAAAGTCGTCTGTGACAATC 3' 


DN: 5* TGYTCRTTTTCTTTCAGAGTTGC 3* 




GAD1 


UP: 5' RCTGGTCCTCTTCACCTCAGAAC 3 1 


DN: 5* ACATTGTCVGTTCCAAAGCCAAG 3' 


10 


CTLA4 


UP: 5' AGGTYCGGGTGACDGTGCTKC 3* 


DN: 5* TGGRTACATGAGYTCCACCTTGC 3* 




CD28 


UP: 5' AGCTGCARGTATWCCTACAAYCT 3' 


DN: 5* GTYCCRTTGCTCYTCTCRTTGYC 3' 




Microsatellite markers 




TGLA44 


UP:5" AACTGTATATTGAGAGCCTACCATG 3* 


DN: 5* CACACCTTAGCGACTAAACCACCA 3* 




BULGE27 


UP: 5* CTACCTAACAGAATGATTTTGTAAG 3' 


DN: 5* AGTGTTCTTGCCTAGAGAATCCCAG 3' 


15 


BULGE23 


UP: S ACATTCTCTCACCAATATGACATAC 3* 


DN: 5' TAAGTCACCATTACATCCTTAGAAC 3 




BM81124 


UP: 5 GCTGTAAGAATCTTCATTAAGCACT 3' 


DN: 5* CCTGATACATGCTAAGGTTAAAAAC 3" 




BULGE28 


UP: 5' AGGCATACATCTGGAGAGAAACATG 3* 


DN: 5* CAGAGGAGCCTAGCAGGCTACCGTC 3' 




BULGE20 


UP: 5' CAGCAGGTCTGTTGAAGTGTATCAG 3' 


DN: 5' AGTGGTAGCATTCACAGGTAGCCAG 3' 




BM3627 


UP: 5' CAGTCCATGGCACCATAAAG 3' 


DN: 5' TCCGTTAGTACTGGCTAATTGC 3" 


20 


1LSTS026 


UP: 5' CTGAATTGGCTCCAAAGGCC 3' 


DN: 5' AAACAGAAGTCC AG GGCTGC 3' 




INRA40 


UP: 5' TCAGTCTCCAGGAGAGAAAAC 3' 


DN: 5' CTCTGCCCTGGGGATGATTG 3' 




Bovine Myostatin primers 




GDF8.19 


5' AATGTATGTTTATATTTACCTGTTCATG 3' 






GDF8.11 


5' ACAGTGTTTGTGCAAATCCTGAGAC 3' 




25 


GDF8.12 


5' CAATGCCTAAGTTGGATTCAGGTTG 3* 






GDF8.25 


5' CTTGCTGTAACCTTCCCAGGACCAG 3' 






GDF8.15 


5' TCCCATCCAAAGGCTTCAAAATC 3' 






GDFB.21 


5' ATACTCWAGGCCTAYAGCCTGTGGT 3' 





Reading from left to right and down the table, the sequences given in Table 1 are Identified as 
30 SEQ ID NO:12 to SEQ ID NO:53, respectively. 



These CATS were used to screen a 6-genome equivalent bovine YAC library by 
PCR using a three-dimensional pooling strategy as described by Libert etai, 1993. The same 
YAC library was also screened with all microsatellite markers available for proximal BTA2, i.e. 
TGLA44, BM81124, BM3627, ILSTS026, INRA40 and TGLA431 (Kappes et a/., 1997). 

35 Potential overlap between the YACs obtained with this panel of STS's was 

explored on the basis of common STS content, as well as cross-hybridization between SINE- 
PCR product from individual YACs. From this analysis, three independent YAC contigs emerged 
in the region of interest: (i) contig A containing microsatellites TGLA44, BM81 124 and Type I 
marker INPP1\ (ii) contig B containing Col3AI and Col5A2\ and (iii) contig C containing 

40 microsatellite markers BM3627, ILSTS026 and INRA40, and Type I marker TFPI. 

None of the available microsatellites mapped to contig B, therefore this cluster 
of YACs could not be positioned in cattle with respect to the two other contigs. Available 
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mapping information in the human, however, allowed prediction of contig B's position between 
contigs A and C. To test this hypothesis, two new microsatellite markers were isolated from 
contig B, BULGE20 and BULGE28. BULGE20 proved to be polymorphic, allowing for 
genotyping of the experimental backcross population. 
5 In addition, to increase the informativeness of the markers available for contig A, 

two new microsatellite markers were developed from this contig: BULGE23 and BULGE27. 
BULGE23 proved to be polymorphic and was used to type the same pedigree material. 

All resulting genotypes were used to construct a linkage map using the ILINK 
program (Lathrop and Lalouel, 1984). The following most likely order and sex-averaged 
1 0 recombination rates between adjacent markers was obtained: {TGLA44-(0%)-BULG23]-(6,1 % )- 
BULG20-(1,6%)-ILSTS026-(2.3%)-INRA40-(7,1%)-TGLA431. The position of BULGE20 between 
TGLA44 and 1LSTS026 confirmed the anticipated order of the three contigs. Figure 1 
summarizes the resulting mapping information. 

A multi point linkage analysis was undertaken using LINKMAP, to position the 
15 mh locus with respect to the new marker map. Linkage analysis was performed under a simple 
recessive model, assuming full penetrance for mh/mh individuals and zero penetrance for the 
two other genotypes. The LOD score curve shown in Figure 1 was obtained, placing the mh locus 
in the TGLA44-BULGE20 interval with an associated maximum LOD score of 26.4. Three 
backcross individuals were shown to recombine with the BULGE20 and distal markers, but not 
20 with TGLA44 and BULGE23, therefore placing the mh locus proximal from this marker. One 
individual, was shown to recombine with TGLA44 and BULGE23, but not with the more distal 
markers, therefore positioning the mh locus distal from TGLA44 and BULGE23. Given the 
relative position of these microsatellite markers with respect to INPP1 and Col3AI as deduced 
from the integration of the human and bovine map, these results indicated that the mh gene is 
25 likely located in a chromosome segment bounded by INPP1 and Col3AI. 

Recently, McPherron etai (1997) demonstrated that mice homozygous for a 
knock-out deletion of GDF-8 or myostatin, were characterized by a generalized increase in 
skeletal muscle mass. Using the published 2676bp murine myostatin cDNA sequence 
(GenBank accession number U84005), a Tentative Human Consensus (THC) cluster in the 
30 Unigene database was identified which represented three cDNA clones (221299, 300367, 
308202) and six EST (Expressed Sequence Tag) sequences (H92027, H92028, N80248, 
N95327, W07375, W24782). The corresponding THC covered 1296 bp of the human myostatin 
gene, showing an homology of 78.1 % with the murine sequence when averaged over the entire 
sequence, and 91.1% when considering only the translated parts of the human and murine 
35 genes (566bp). This THC therefore very likely corresponds to the human orthologue of the 

murine myostatin gene. Primers (5-GGCCCAACTATGGATATATTTG-3' (SEQ ID NO:9) and 5'- 
GGTCCTGGGAAGGTTACAGCA-3* (SEQ ID NO:10)) were thus prepared to amplify a 272 bp 
fragment from the second exon of human myostatin and used to genotype the whole-genome 
Genebridge-4 radiation hybrid panel (Walter et a/., 1994). The RHMapper program (Slonim et 
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a/. f unpublished) was used to position the myostatin gene with respect to the Whitehead/MIT 
framework radiation hybrid map, placing it at position 948.7 cR of the HSA2 map with an 
associated iodscore > 3. Closer examination of the myostatin segregation vector and its 
confrontation with the vectors from all markers located in that region (Data Release 1 1 .9 t May 
5 1997) showed it to be identical to EST SGC38239 placed on the Whitehead/MIT radiation hybrid 
map (Hudson et a/. f 1 995) at position 946.8 cR of HSA2. This places the human myostatin gene 
on the RH-map in the interval between Col3AI (EST WI16343 - 942.5 cR) and INPP1 (EST 
L08488 - 950.2 to 951 .2 cR)(Figure 1). Myostatin therefore appeared as a very strong positional 
candidate for the mh gene. 

10 To test the potential involvement of myostatin in the determinism of double- 

muscling in cattle, primer pairs were designed based on the available mouse and human 
myostatin sequence, with the objective to amplify the entire coding sequence from bovine cDNA 
using PCR (Polymerase Chain Reaction). Whenever possible, primers were therefore positioned 
in portions of the myostatin sequence showing 100% homology between mouse and human. 

1 5 Two primer pairs were identified that amplified what was predicted to represent 98.4% of the 
bovine coding sequence plus 74 bp of 3' untranslated sequence, in two overlapping DNA 
fragments, respectively 660 (primers GDF8.19 - GDF8.12) and 724 bp (primers GDF8.1 1 - 
GDF8.21) long. The expected DNA products were successfully amplified from cDNA 
generated from skeletal muscle of both a normal (homozygous +/+) (SEQ ID NO:1) and a 

20 double-muscled (homozygous mh/mh) (SEQ ID NO:3) animal, and cycle-sequenced on both 
strands. 

The nucleotide sequence corresponding to the normal allele presented 88.9% 
identity with the mouse myostatin sequence (SEQ ID NO:5) over a 1067 bp overlap, and 
contained the expected open reading frame encoding a protein (SEQ ID NO:2) showing 92.9% 

25 identity in a 354 amino-acid overlap with mouse myostatin (SEQ ID NO:6). As expected for a 
member of the TGFp superfamily, the bovine myostatin gene is characterized by a proteolytic 
processing site thought to mediate cleavage of the bioactive carboxy-terminal domain from the 
longer N-terminal fragment, and by nine cysteine residues separated by a characteristic spacing 
and suspected to be involved in intra- and inter-molecular disulfide bridges (McPherron and Lee, 

30 1996). 

The nucleotide sequence obtained from the mh allele was identical to the 
normal allele over its entire length, except for an 1 1 bp deletion involving nucleotides 821 to 831 
(counting from the initiation codon). This frame shifting deletion, occurring after the first cysteine 
residue of the carboxy-terminal domain, drastically disrupts the downstream amino-acid 
35 sequence and reveals a premature stop-codon after 13 amino acids, see Figure 2. The amino 
acid sequence encoded by the mutant nucleic acid sequence is identified as SEQ ID NO:4. This 
mutation disrupts the bioactive part of the molecule and is therefore very likely to be the cause of 
the recessive double-muscling phenotype. Following conventional nomenclature, this mutation 
will be referred to as nt821(dei11). 
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To further strengthen the assumption of the causality of this mutation, primer 
pairs flanking the deletion (Figure 2) were prepared and the corresponding DNA segment from 
all animals from the experimental backcross population amplified. PCR was performed in the 
presence of dCTP 32 in order to radioactively label the amplification product. Amplification 

5 products were separated on denaturing polyacrylamide gels and detected by autoradiography. A 
1 88 bp product would be expected for the normal allele and a 1 77 bp product for the 
nt821(del11) allele. Correlation between phenotype and genotype was matched for the entire 
pedigree. All ten BBCB double-muscled sires were found to be homozygous for the 
nt821(dei11) mutation, all 41 F1 females were heterozygous, while 53 double-muscled offspring 

1 0 were homozygous for the mutation, the remaining 55 conventional animals were heterozygous. 

To examine the distribution of the nt821(del11) mutation in different conventional 
and double-muscled breeds, a cohort of 25 normal individuals was genotyped representing two 
dairy breeds (Holstein-Friesian, Red-and -White) and a cohort of 52 double-muscled animals 
representing four breeds (BBCB, Asturiana, Maine-Anjou and Piemontese). The results are 

1 5 summarized in Table 2. All dairy animals were homozygous normal except for one Red-and- 
White bull shown to be heterozygous. The occurrence of a small fraction of individuals carrying 
the mutation in dairy cattle is not unexpected as the phenotype is occasionally described in this 
breed, in BBCB and Asturiana, all double-muscled animals were homozygous for the 
nt821(del11) deletion, pointing towards allelic homogeneity in these two breeds. Double- 

20 muscled Maine-Anjou and Piemontese animals were homozygous "normal", i.e. they did not 
show the nt821(del1 1) deletion but a distinct cysteine to tyrosine substitution (C313Y) in double- 
muscled Piedmontese animals identified by others (Kambadur et a/., 1997) was discovered. 



Table 2: 



25 



Breed 


Phenotype 


Genotype 

+/+ +/nt821(del11) ntB21(del11)/nt821(del11) 


Belgian Blue 


DM 






29 


Asturiana 


DM 






10 


Piemontese 


DM 


8 






Maine-Anjou 


DM 


4 






Holstein-Friesian 


Normal 


13 






Red-and-White 


Normal 


12 


1 





The entire coding sequence was also determined for the myostatin gene in 
double-muscled individuals from ten European cattle breeds and a series of mutations that 
35 disrupt myostatin function were identified. 
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The coding sequence of four control Holstein-Friesian and Jersey individuals 
was identical to the previously described wild-type allele (Grobet et aL, 1 997), further indicating 
that it was the genuine myostatin coding sequence being amplified, and not a non-functional 
pseudogene. 

5 Amongst the 32 double-muscled animals, seven DNA sequence variants within 

the coding region were found, as summarized in Figure 3. 

In addition to the nt821(del11) mutation in the third exon, described above, four 
new mutations that would be expected to disrupt the myostatin function were found. An 
insertion/deletion at position 419 counting from the initiation codon, replacing 7 base pairs with an 

10 apparently unrelated stretch of 10 base pairs, reveals a premature stop codon in the N-terminal 
latency-associated peptide at amino-acid position 140. This mutation is referred to as 
nt419(de!7-ins10). Two base pair substitutions in the second exon, a C— »T transition at 
nucleotide position 610 and a G— *T transversion at nucleotide position 676, each yield a 
premature stop codon in the same N-terminal latency-associated peptide at amino-acid positions 

15 204 and 226 respectively. These mutations are called Q204X and E226X respectively. Finally, a 
G-*A transition at nucleotide position 938 results in the substitution of a cysteine by a tyrosine. 
This mutation is referred to as C313Y. This cysteine is the fifth of nine highly conserved cysteine 
residues typical of the members of the TGF-p superfamily and shared in particular by TGF-pi , - 
P2 and -p3, and inhibin-pA and -pB (McPherron & Lee, 1 996). It is thought to be involved in an 

20 intramolecular disulfide bridge stabilizing the three-dimensional conformation of the bioactive 
carboxyterminal peptide. Its substitution is therefore likely to affect the structure and function of 
the protein. This C313Y has recently also been described by Kambadur et al. (1997). 

A conservative phenylalanine to leucine substitution was also found at amino- 
acid position 94 in the first exon, due to a C->A transversion at nucleotide position 282 of the 

25 myostatin gene. Given the conservative nature of the amino-acid substitution, its location in the 
less conserved N-terminal latency-associated peptide, and as this mutation was observed at the 
homozygous condition in animals that were not showing any sign of exceptional muscular 
development, this mutation probably does not interfere drastically with the myostatic function of 
the encoded protein, if at all. This mutation is referred to as F94L The murine protein is 

30 characterized by a tyrosine at the corresponding amino-acid position. 

Also identified was a silent C—^T transition at the third position of the 138th 
cytosine codon in the second exon, referred to as nt414(C-T). 

In addition to these DNA sequence polymorphisms detected in the coding region 
of the myostatin gene, also found were four DNA sequence variants in intronic sequences which 

35 are probably neutral polymorphisms and which have been assigned the following symbols: 

nt374-51(T-C), nt374-50(G-A), nt374-16(de!1) in intron 1 , and nt748-78(de!1) in intron 2 (Figure 
3). 

Figure 4 shows the observed distribution of mutations in the analysed sample 
sorted by breed. For the majority of the studied breeds, the analyzed double-muscled animals 
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were homozygous for one of the five described mutations expected to disrupt the myostatin 
function or compound heterozygotes for two of these mutations. This is compatible with the 
hypothesis that the double-muscled condition has a recessive mode of inheritance in all these 
breeds. 

5 Only in Limousin and Blonde d'Aquitaine was there no clear evidence for th 

role of myostatin loss-of-function mutations in the determinism of the observed muscular 
hypertrophy. Most Limousin animals were homozygous for the conservative F94L substitution 
which is unlikely to cause the muscular hypertrophy characterizing these animals, as discussed 
above. One Limousin animal proved to be heterozygous for this mutation, the other allele being 

10 the "wild-type" one. All Blonde d'Aquitaine animals were homozygous "wild-type". These data 
indicate either that the myostatin gene is possibly not involved in the double-muscled condition 
characterizing these two breeds, or that there are additional myostatin mutations outside of the 
coding region. The double-muscling condition is often considered to be less pronounced in 
Limousin animals compared to other breeds. 

15 The data indicate that some mutations, such as the nt821del(11) and C313Y, 

are shared by several breeds which points towards gene migration between the corresponding 
populations, while others seems to be confined to specific breeds. Moreover, while some breeds 
(the Belgian Blue breed in particular) seem to be essentially genetically homogeneous others 
show clear evidence for allelic heterogeneity (e.g. Maine-Anjou). 

20 The observation of allelic heterogeneity contradicts with the classical view that a 

single mh mutation spread through the European continent in the beginning of the 19th century 
with the dissemination of the Shorthorn breed from the British Isles (Menissier, 1982). Two of 
the mutations at least are shared by more than one breed, indicating some degree of gene 
migration but definitely not from a single origin. 

25 in mice, and in addition to the in vitro generated myostatin knock-out mice 

(McPherron & Lee, 1997), the compact mutation could be due to a naturally occurring mutation 
at the myostatin gene. The compact locus has been mapped to the D1Mit375-D1Mit21 interval 
on mouse chromosome 1 known to be orthologous to HSA2q31-32 and BTA2q12-22 (Varga et 
a/., 1997). 

30 From an applied point of view, the characterisation of a panel of mutations in 

the myostatin gene associated with double-muscling contributes to the establishment of a 
diagnostic screening system allowing for marker assisted selection for or against this condition in 
cattle. 

Example 1 
35 Genetic and physical mapping 

Integration of the HSA2q31-32 and BTA2q12-22 maps was done by using 
coincident YAC's and the mh locus was positioned in the interval flanked by Col3AI and INPP1 as 
follows. Genetic mapping was performed using a previously described (Holstein-Friesian x 
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Belgian Blue) x Belgian Blue experimental backcross population counting 108 informative 
individuals (Charfier et a/., 1 995). Microsatellite genotyping was performed according to standard 
procedures (Georges et a/., 1995), using the primer sequences reported in Table 1 . Linkage 
analyses were performed with the MLINK, ILINK and LINKMAP programs of the LINKAGE 
5 (version 5.1) and FASTLINK (2.3P version, June 1995) packages (Lathrop & Lalouel, 1984; 
Cottingham et a/., 1993). The YAC library was screened by PCR using a three dimensional 
pooling scheme as described in Libert et a/., 1993. The primer pairs corresponding to the CATS 
used to screen the library are reported in Table 1 . Cross-hybridisation between SINE-PCR 
products of individual YACs was performed according to Hunter et al. (1 996), using primers 
1 0 reported in Lenstra et al. (1 993). Microsatellites were isolated from YACs according to Cornelis 
etal (1992). 

Example 2 

Mapping of the human myostatin gene on the Genebridge-4-panel 

DNA from the Genebridge-4 panel (Walter et a/. f 1994) was purchased from 
15 Research Genetics (Huntsville, Alabama), and genotyped by PCR using standard procedures 
and the following human myostatin primer pair (S'-GGCCCAACTATGGATATATTTG-S' and 5- 
G GTC CTG G G AAG GTTAC AG C A-3*) . Mapping was performed via the WWW server of the 
Whitehead Institute/MIT Center for Genome Research using their RH-mapper program (Slonim, 
D.; Stein, L.; Kruglyak, L.; Lander, E M unpublished) to position the markers with respect to the 
20 framework map. Segregation vectors of the query markers were compared with the vectors from 
ail markers in the region of interest in the complete Data Release 1 1 .9 (May 1997) to obtain a 
more precise position. This positions myostatin in the INPP1-Col3AI on the human map with LOD 
score superior to 3. 

Example 3 
25 RT-PCR 

To clone the bovine myostatin orthologue a strategy based on RT-PCR 
amplification from skeletal muscle cDNA was chosen. Total RNA was extracted from skeletal 
muscle (Triceps brachiaiis) according to Chirgwin etal. (1979). RT-PCR was performed using 
the Gene-Amp RNA PCR Kit (Perkin-Elmer) and the primers reported in Table 1 . The PCR 
30 products were purified using QiaQuick PCR Purification kit (Qiagen) and sequenced using Dye 
terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic 
sequencer, using the primers reported in Table 2. 

Example 4 

Diagnosis of the nt821(deI11) deletion 

35 To diagnose the nt821(del1 1) the following primer sequences were designed 

flanking the nt821(de!11) deletion: 5-TCTAG G AG AG ATTTTG G GCTT-3' (SEQ ID NO:53) and 5- 
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GATGGGTATGAGGATAC Mil GC-3' (SEQ ID NO:52). These primers amplify a 188 bp 
fragments from normal individuals and a 177bp fragment from double-muscled individuals. 
Heterozygous individuals show the two amplification products. These amplification products can 
be detected using a variety of methods. In this example the PCR product was labelled by 

5 incorporation of dCTP 32 , separated on a denaturing acrylamide gel and revealed by 
autoradiography. Other approaches that could be used to distinguish the three different 
genotypes are known to those skilled in the art and would include separation in agarose gels and 
visualization with ethidium bromide, direct sequencing, TaqMan assays, hybridization with allele 
specific oligonucleotides, reverse dot-blot, RFLP analysis and several others. The specificity of 

1 0 the test is linked to the detected mutation and not to the primers used in the detection method. 
That means that other primers can easily be designed based on said bovine myostatin sequence 
that would fulfill the same requirements. 

Example 5 

Determination of mutations in other breeds 

15 A total of 32 animals with extreme muscular development were sampled from 

ten European beef cattle breeds in which double-muscled animals are known to occur at high to 
moderate frequency: (i) Belgium: Belgian Blue (4), (ii) France: Blonde d'Aquitaine (5), Charolais 
(2), Gasconne (2), Limousin (5), Maine-Anjou (4), Parthenaise (3), (Hi) Spain: Asturiana (2), 
Rubia Gallega (2), (iv) Italy: Piedmontese (2). The determination of the double-muscled 

20 phenotype of the sampled animals was performed visually by experienced observers. Four 

animals with conventional phenotype sampled from the Holstein-Friesian (2) and Jersey (2) dairy 
populations were analysed as controls. 

In order to facilitate the study of the myostatin coding sequence from genomic 
DNA, the sequences of the exon-intron boundaries of the bovine gene were determined. In mice, 

25 the myostatin gene is known to be interrupted by two introns, respectively * 1 .5 and 2.4 Kb long 
(McPherron & Lee, 1997). Two primer pairs were thus designed, respectively, in bovine exons 1 
and 2, and exons 2 and 3, that were predicted to flank the two introns, assuming conservation of 
gene organisation between mouse and cattle (Figure 3 and Table 3). Using these primer sets, 
two PCR products respectively 2Kb and 3.5Kb long were generated from a YAC clone (179A3) 

30 containing the bovine myostatin gene (Grobet et a/., 1997). The PCR products were purified 
using QiaQuick PCR Purification kit (Qiagen) and partially sequenced using Dye terminator Cycle 
Sequencing Ready Reaction (Perkin-Elmer) and an ABI373 automatic sequencer. Alignment 
with the bovine cDNA sequence identified the four predicted exon-intron boundaries. The 
nucleotide sequence corresponding to bovine genomic DNA is identified as SEQ ID NO:54. 
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Table 3: Prim rs used for PCR amplification and cycl sequencing. 



lntron1-5' 


5*~GAAGACGATGACTACCAC 
GCCAGGACG-3* 


lntron1-3* 


5--CTAGTTTATTGTATTGTATCTT 
AGAGC-3* 


Intrortf-S* 


S'-AGACTCCTACAACAGTGT 
TTGT-3* 


!ntron2-3* 


5-ATACTCWAGGCCTAYAGCCT 
GTGGT-3' 


Exon1-5* 


5*-ATTCACTGGTGTGGCAAG 
TTGTCTCTCAGA-3 1 


Exon1-3' 


5'.CCCTCCTCCTTACATACAAGC 
CAGCAG-3* 


Exon2-5* 


5-GTTCATAGATTGATATGG 
AGGTGTTCG-3' 


Exon2-3' 


S'-ATAAGCACAGGAAACTGGTAG 
TTATT*3' 


Exon3-5' 


S-GAAATGTGACATAAGCAA 
AATGATTAG-3' 


Exon3-3* 


5-ATACTCWAGGCCTAYAGCCT 
GTGGT-3' 


Exon1-Seq1 


S'-TTGAGGATGTAGTGTTTT 
CC-3' 


Exonl -Seq2 


5-GCCATAAAAATCCAAATCCTC 
AG-3 1 


Exon2-Seq1 


5-CATTTATAGCTGATCTTC 
TAACGCAAG-3* 


Exon2-Seq2 


5'-TGTCGCAGGAGTCTTGACAG 
GCCTCAG-3' 


Exon2-Seq3 


5-GTACAAGGTATACTGGAA 
TCCGATCTC-3* 






Exon3-Seq1 


5-AGCAGGGGCCGGCTGAA 
CCTCTGGG-3 1 


Exon3-Seq2 


5'-CCCCAGAGGTTCAGCCGGCC 
CCTGC-3' 



Based on the available exonic and intronic sequences of the bovine myostatin 
gene, three primer pairs that jointly allow for convenient amplification of the entire coding 
sequence from genomic DNA were designed. The position of the corresponding primers is 
shown in Figure 3, and the corresponding sequences are reported in Table 3. 

15 After PCR amplification of the entire coding sequence from genomic DNA in the 

three described fragments, these were purified using QiaQuick PCR Purification kit (Qiagen) and 
sequenced using Dye terminator Cycle Sequencing Ready Reaction (Perkin-Elmer) and an 
ABI373 automatic sequencer, using the primers used for amplification as well as a series of 
nested primers (Figure 3 and Table 3). Chromat files produced with the ABI373 sequencer were 

20 analysed with the Polyphred application (D. Nickerson, personal communication), which is part of 
a series of sequence analysis programs including Phred (Ewing, B. & Green, P. (1992), 
unpublished), Phrap (Green, P. (1994), unpublished) and Consed (Gordon, D. (1995), 
unpublished), but any suitable sequencing programme would do, as known to a person skilled in 
the art. 

25 Monoclonal antibodies (Mab's) specific for myostatin are useful. In the case of 

the bovine protein having the amino acid sequence identified as SEQ ID NO:2, for example, 
antibodies can be used for diagnostic purposes such as for determining myostatin protein levels 
in muscle tissue. To produce these antibodies, purified myostatin is prepared. The myostatin 
can be produced in bacterial cells as a fusion protein with glutathione-S-transferase using the 

30 vector pGEX2 (Pharmacia). This permits purification of the fusion protein by GSH affinity 
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chromatography. In another approach, myostatin is expressed as a fusion protein with the 
bacterial maltose binding domain. The fusion protein is thus recovered from bacterial extracts by 
passing the extract over an amylose resin column followed by elution of the fusion protein with 
maltose. For this fusion construct, the vector pMalC2, commercially available from New England 
5 Biolabs, can be used. The preparation of a second fusion protein is also useful in the preliminary 
screening of MAb's. 

The generation of hybridomas expressing monoclonal antibodies recognizing 
myostatin protein is carried out as follows: BALB/c mice are injected intraperitoneal^ with 
protein/adjuvant three times at one-month intervals, followed by a final injection into the tail vein 
10 shortly prior to cell fusion. Spleen cells are harvested and fused with NS-1 myeloma cells 
(American Type Culture Collection, Rockville, MD) using polyethylene glycol 4000 according to 
standard protocols (Kennett, 1979; Mirski, 1989). The cell fusion process is carried out as 
described in more detail below. 

The fused cells are plated into 96-well plates with peritoneal exudate cells and 
15 irradiated spleen cells from BALB/Ccmice as feeder layers and selection with hypoxanthine, 
aminopterin, and thymidine (HAT medium) is performed. 

An ELISA assay is used as an initial screening procedure. 1 -1 0 pg of purified 
myostatin (cleaved from the fusion protein) in PBS is used to coat individual wells, and 50-100 pi 
per well of hybridoma supernatants is incubated. Horseradish peroxidase-conjugated anti- 
20 mouse antibodies are used for the colorimetric assay. 

Positive hybridomas are cloned by limiting-dilution and grown to large-scale for 
freezing and antibody production. Various positive hybridomas are selected for usefulness in 
western blotting and immunohistochemistry, as well as for cross reactivity with myostatin proteins 
from different species, for example the mouse and human proteins. 
25 Alternatively, active immunization by the generation of an endogenous antibody 

by direct exposure of the host animal to small amounts of antigen can be carried out. Active 
immunization involves the injection of minute quantities of antigen (g) which probably will not 
induce a physiological response and will be degraded rapidly. Antigen will only need to be 
administered as prime and boost immunizations in much the same manner as techniques used 
30 to confer disease resistance (Pell et a/., 1 997). 

Antisense nucleic acids or oligonucleotides (RNA or preferably DNA) can be 
used to inhibit myostatin production in order to increase muscle mass of an animal. Antisense 
oligonucleotides, typically 1 5 to 20 bases long, bind to the sense mRNA or pre mRNA region 
coding for the protein of interest, which can inhibit translation of the bound mRNA to protein. The 
35 cDNA sequence encoding myostatin can thus be used to design a series of oligonucleotides 
which together span a large portion, or even the entire cDNA sequence. These oligonucleotides 
can be tested to determine which provides the greatest inhibitory effect on the expression of the 
protein (Stewart, 1996). The most suitable mRNA target sites include 5 - and 3-untranslated 
regions as well as the initiation codon. Other regions might be found to be more or less effective. 
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Alternatively, an antisense nucleic acid or oligonucleotide may bind to myostatin coding or 
regulatory sequences. 

Rather than reducing myostatin activity by inhibiting myostatin gene expression 
at the nucleic acid level, activity of the myostatin protein may be directly inhibited by binding to an 
5 agent, such as, for example, a suitable small molecule or a monoclonal antibody. 

It will of course be understood, without the intention of being limited thereby, that 
a variety of substitutions of amino acids is possible while preserving the structure responsible for 
myostatin activity of the proteins disclosed herein. Conservative substitutions are described in the 
patent literature, as for example, in United States Patent No. 5,264,558 or 5,487,983. It is thus 

10 expected, for example, that interchange among non-polar aliphatic neutral amino acids, glycine, 
alanine, proline, valine and isoleucine, would be possible. Likewise, substitutions among the 
polar aliphatic neutral amino acids, serine, threonine, methionine, asparagine and glutamine 
could possibly be made. Substitutions among the charged acidic amino acids, aspartic acid and 
glutamic acid, could probably be made, as could substitutions among the charged basic amino 

15 acids, lysine and arginine. Substitutions among the aromatic amino acids, including 

phenylalanine, histidine, tryptophan and tyrosine would also likely be possible. These sorts of 
substitutions and interchanges are well known to those skilled in the art. Other substitutions 
might well be possible. Of course, it would also be expected that the greater the percentage of 
homology, i.e., sequence similarity, of a variant protein with a naturally occurring protein, the 

20 greater the retention of metabolic activity. Of course, as protein variants having the activity of 
myostatin as described herein are intended to be within the scope of this invention, so are 
nucleic acids encoding such variants. 

A further advantage may be obtained through chimeric forms of the protein, as 
known in the art. A DNA sequence encoding the entire protein, or a portion of the protein, could 

25 thus be linked, for example, with a sequence coding for the C-terminal portion of £. cofi B- 

galactosidase to produce a fusion protein. An expression system for human respiratory syncytial 
virus glycoproteins F and G is described in United States Patent No. 5,288,630 issued February 
22, 1994 and references cited therein, for example. 

A recombinant expression vector of the invention can be a plasmid, as described 

30 above. The recombinant expression vector of the invention further can be a virus, or portion 
thereof, which allows for expression of a nucleic acid introduced into the viral nucleic acid. For 
example, replication defective retroviruses, adenoviruses and adeno-associated viruses can be 
used. 

The recombinant expression vectors of the invention can be used to make a 
35 transformant host cell including the recombinant expression vector. The term "transformant host 
cell" is intended to include prokaryotic and eukaryotic cells which have been transformed or 
transfected with a recombinant expression vector of the invention. The terms "transformed with", 
"transfected with", "transformation" and Iransfection" are intended to encompass introduction of 
nucleic acid (e.g. a vector) into a cell by one of many possible techniques known in the art. 
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Prokaryotic cells can be transformed with nucleic acid by, for example, electroporation or 
calcium-chloride mediated transformation. Nucleic acid can be introduced into mammalian cells 
via conventional techniques such as calcium phosphate or calcium chloride coprecipitation, 
DEAE-dextran-mediated transfection, lipofection, electroporation or microinjection. Suitable 
5 methods for transforming and transfecting host cells are known (Sambrook, 1 989). 

The number of host cells transformed with a recombinant expression vector of 
the invention by techniques such as those described above will depend upon the type of 
recombinant expression vector used and the type of transformation technique used. Plasmid 
vectors introduced into mammalian cells are integrated into host cell DNA at only a low 
10 frequency. In order to identify these integrants, a gene that contains a selectable marker (e.g. 
resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. 
Preferred selectable markers include those which confer resistance to certain drugs, such as 
G418 and hygromycin. Selectable markers can be introduced on a separate plasmid from the 
nucleic acid of interest or, preferably, are introduced on the same plasmid. Host cells 
1 5 transformed with one or more recombinant expression vectors containing a nucleic acid of the 
invention and a gene for a selectable marker can be identified by selecting for cells using the 
selectable marker. For example, if the selectable marker encodes a gene conferring neomycin 
resistance (such as pRc/CMV), transformant cells can be selected with G418. Cells that have 
incorporated the selectable marker gene will survive, while the other cells die. 
20 Nucleic acids which encode myostatin proteins can be used to generate 

transgenic animals. A transgenic animal (e.g., a mouse) is an animal having cells that contain a 
transgene, which transgene is introduced into the animal or an ancestor of the animal at a 
prenatal, e.g., an embryonic stage. A transgene is a DNA which is integrated into the genome of 
a cell from which a transgenic animal develops. In one embodiment, a bovine cDNA, comprising 
25 the nucleotide sequence shown in SEQ ID NO:1 , or an appropriate variant or subsequence 
thereof, can be used to generate transgenic animals that contain cells which express bovine 
myostatin. Likewise, variants such as mutant genes (e.g. SEQ ID NO:3) can be used to generate 
transgenic animals. This could equally well be done with the human myostatin protein and 
variants thereof. "Knock out" animals, as described above, can also be generated. Methods for 
30 generating transgenic animals, particularly animals such as mice, have become conventional in 
the art are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009. In a preferred 
embodiment, plasmids containing recombinant molecules of the invention are microinjected into 
mouse embryos. In particular, the plasmids are microinjected into the male pronuclei of fertilized 
one-cell mouse eggs; the injected eggs are transferred to pseudo-pregnant foster females; and, 
35 the eggs in the foster females are allowed to develop to term. (Hogan, 1986). Alternatively, an 
embryonal stem cell line can be transfected with an expression vector comprising nucleic acid 
encoding a myostatin protein, and cells containing the nucleic acid can be used to form 
aggregation chimeras with embryos from a suitable recipient mouse strain. The chimeric 
embryos can then be implanted into a suitable pseudopregnant female mouse of the appropriate 
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strain and the mbryo brought to term. Progeny harboring the transfected DNA in their germ 
cells can be used to breed uniformly transgenic mice. 

Such animals could be used to determine whether a sequence related to an 
intact myostatin gene retains biological activity of myostatin. Thus, for example, mice in which 
5 the murine myostatin gene has been knocked out and containing the nucleic acid sequence 
identified as SEQ ID NO:1 could be generated along with animals containing the nucleic acid 
sequence identified as SEQ ID NO:3. The animals could be examined for display of muscular 
hyperplasia, especially in comparison with knockout mice, which are known to display such. In 
this way it can be shown that the protein encoded by SEQ ID NO:3 lacks myostatin activity within 

10 the context of this invention while the protein encoded by the nucleic acid sequence identified as 
SEQ ID NO:1 possesses biological activity of myostatin. 

In such experiments, muscle cells would be particularly targeted for myostatin 
(and variants) transgene incorporation by use of tissue specific enhancers operatively linked to 
the encoding gene. For example, promoters and/or enhancers which direct expression of a 

15 gene to which they are operatively linked preferentially in muscle cells can be used to create a 
transgenic animal which expresses a myostatin protein preferentially in muscle tissue. 
Transgenic animals that include a copy of a myostatin transgene introduced into the germ line of 
the animal at an embryonic stage can also be used to examine the effect of increased myostatin 
expression in various tissues. 

20 The pattern and extent of expression of a recombinant molecule of the invention 

in a transgenic mouse is facilitated by fusing a reporter gene to the recombinant molecule such 
that both genes are co-transcribed to form a polycistronic mRNA. The reporter gene can be 
introduced into the recombinant molecule using conventional methods such as those described 
in Sambrook etal. t (Sambrook, 1989). Efficient expression of both cistrons of the polycistronic 

25 mRNA encoding the protein of the invention and the reporter protein can be achieved by 

inclusion of a known internal translational initiation sequence such as that present in poliovirus 
mRNA. The reporter gene should be under the control of the regulatory sequence of the 
recombinant molecule of the invention and the pattern and extent of expression of the gene 
encoding a protein of the invention can accordingly be determined by assaying for the phenotype 

30 of the reporter gene. Preferably the reporter gene codes for a phenotype not displayed by the 
host cell and the phenotype can be assayed quantitatively. Examples of suitable reporter genes 
include lacZ (P-galactosidase), neo (neomycin phosphotransferase), CAT (chloramphenicol 
acetyltransferase) dhfr (dihydrofolate reductase), aphlV (hygromycin phosphotransferase), lux 
(luciferase), uidA (p-glucuronidase). Preferably, the reporter gene is lacZ which codes for p- 

35 galactosidase. p-galactosidase can be assayed using the lactose analogue X-gal (5-bromo-4- 
chloro-3-indolyl-b-D-galactopyranoside) which is broken down by p-galactosidase to a product 
that is blue in color (Old). 

The present invention includes knocking out wild type myostatin in mammals, in 
order to obtain the desired effect(s) thereof. This is particularly true in cattle raised for beef 
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production. It may well prove advantageous to substitute a defective gene (e.g. SEQ ID NO:3 or 
it genomic analogue) rather than delete the entire sequence of DNA encoding for a protein 
having myostatin activity. A method of producing a transgenic bovine or transgenic bovine 
embryo is described in United States Patent No, 5,633,076, issued May 27, 1997, for example. 

5 The transgenic animals of the invention can be used to investigate the molecular 

basis of myostatin action. For example, it is expected that myostatin mutants in which one or 
more of the conserved cysteine residues has been deleted would have diminished activity in 
relation to a wild type myostatin protein in which all such residues are retained. Further, deletion 
of proteolytic cleavage site would likely result in a mutant lacking biological activity of myostatin. 

1 o Transgenesis can be used to inactivate myostatin activity. This could be 

achieved using either conventional transgenesis, i.e. by injection in fertilized oocytes, or by gene 
targeting methods using totipotent cell lines such as ES (embryonic stem cells) which can then 
be injected in oocytes and participate in the development of the resulting organisms or whose 
nucleus can be transferred into unfertilized oocytes, nucleus transfer or cloning. 

15 it is also possible to create a genetically altered animal in which the double- 

muscling trait is dominant so that the animal would be more useful in cross-breeding. Further, in 
a particular aspect, the dominant trait would be male specific. In this way, bulls would be double- 
muscled but cows would not be. In addition, or alternatively, the trait would also be unexpressed 
until after birth or inducible. If inducible the trait could be induced after birth to avoid the calving 

20 difficulties described above. 

There are at least three approaches that can be taken to create a dominant 
u mh n allele. Because functional myostatin, a member of the TGF-G superfamily, is a dimer, 
dominant negative myostatin mutations can be created (Herskowitz et a/., 1987; Lopez et a/., 
1992). According to one method, this is accomplished by mutating the proteolytic processing 

25 site of myostatin. To enhance the dominant negative effect, the gene can be put under the 
control of a stronger promoter such as the CMV promoter or that of a myosin gene, which is 
tissue specific, i.e., expressed only in skeletal muscle. Alternatively, an antisense sequence of 
that encoding myostatin could be incorporated into the DNA, so that complementary mRNA 
molecules are generated, as understood by a person skilled in the art. Optionally, a ribozyme 

30 could be added to enhance mRNA breakdown. In another approach, ere recombinase 

generate/ribozyme approach or the Cre-lox P system could be used (Hoess et a/., 1982; Gu et 
a/., 1994). 

Male specificity can be achieved by placing the dominant mh alleles on the Y 
chromosome by homologous recombination. 
35 Inducibility can be achieved by choosing promoters with post-natal expression in 

skeletal muscle or using inducible systems such at he Tet-On and Tet-Off systems could be 
used (Gossen et a/., 1992; Shockett et a/., 1996). 

Using conventional transgenesis a gene coding for a myostatin antisense is 
injected, for example, by inverting the orientation of the myostain gene in front of its natural 



WO 99/02667 



27- 



PCT/IB98/01197 



promoter and enhancer sequences. This is followed by injection of a gene coding for an anti- 
myostain ribozyme, i.e. an RNA that would specifically bind to endogenous myostain mRNA and 
destroy it via its "ribozyme" activity. 

Also, through gene targeting, a conventional knock-out animal can be 

5 generated, specific mutations by gene replacement can be engineered. It is possible to 

inactivate the myostain gene at a specific developmental time, such as after birth to avoid calving 
difficulties. As mentioned above, this could be achieved using the Cre-lox P systems in which 
1.ox P sides are engineered around the myostain gene by homologous recombination (gene 
targeting), and mating these animals with transgenic animals having a Cre transgene (coding for 

0 the Cre recombinase existing DNA flanked by J oxP sides) under the dependence of a skeletal 
muscle specific promoter only active after birth. This is done to obtain individuals that would 
inactivate their myostain gene after birth. As mentioned above, there are also gene targeting 
systems that allow genes to be turned on and off by feeding an animal with, for example, an 
antibiotic. In such an instance, one engineers an operator between the promoter of the gene and 

5 the gene itself. This operator is the target of a repressor which when binding inactivates the gene 
(for example, the lac operon in E. colt). The repressor is brought into the cell using conventional 
transgenesis, for example, by injection of the gene coding for the repressor. 

Transgenic animals of the invention can also be used to test substances for the 
ability to prevent, slow or enhance myostatin action. A transgenic animal can be treated with the 

0 substance in parallel with an untreated control transgenic animal. 

The antisense nucleic acids and oligonucleotides of the invention are useful for 
inhibiting expression of nucleic acids (e.g. mRNAs) encoding proteins having myostatin activity. 

The isolated nucleic acids and antisense nucleic acids of the invention can be 
used to construct recombinant expression vectors as described previously. These recombinant 

5 expression vectors are then useful for making transformant host cells containing the recombinant 
expression vectors, for expressing protein encoded by the nucleic acids of the invention, and for 
isolating proteins of the invention as described previously. The isolated nucleic acids and 
antisense nucleic acids of the invention can also be used to construct transgenic and knockout 
animals as described previously. 

0 The isolated proteins of the invention are useful for making antibodies reactive 

against proteins having myostatin activity, as described previously. Alternatively, the antibodies of 
the invention can be used to isolate a protein of the invention by standard immunoaffinity 
techniques. Furthermore, the antibodies of the invention, including bispecific antibodies are 
useful for diagnostic purposes. 

5 Molecules which bind to a protein comprising an amino acid sequence shown in 

SEQ ID NO:2 can also be used in a method for killing a cell which expresses the protein, wherein 
the cell takes up the molecule, if for some reason this were desirable. Destruction of such cells 
can be accomplished by labeling the molecule with a substance having toxic or therapeutic 
activity. The term "substance having toxic or therapeutic activity" as used herein is intended to 
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include molecules whose action can destroy a cell, such as a radioactive isotope, a toxin (e.g. 
diphtheria toxin or ricin), or a chemotherapeutic drug, as well as cells whose action can destroy a 
cell, such as a cytotoxic cell. The molecule binding to the myostatin can be directly coupled to a 
substance having a toxic or therapeutic activity or may be indirectly linked to the substance. In 
5 one example, the toxicity of the molecule taken up by the cell is activated by myostatin protein. 

The invention also provides a diagnostic kit for identifying cells comprising a 
molecule which binds to a protein comprising an amino acid sequence shown in SEQ ID NO:2, 
for example, for incubation with a sample of tumor cells; means for detecting the molecule 
bound to the protein, unreacted protein or unbound molecule; means for determining the amount 

10 of protein in the sample; and means for comparing the amount of protein in the sample with a 
standard. Preferably, the molecule is a monoclonal antibody. In some embodiments of the 
invention, the detectability of the molecule which binds to myostatin is activated by said binding 
(e.g., change in fluorescence spectrum, loss of radioisotopic label). The diagnostic kit can also 
contain an instruction manual for use of the kit. 

15 The invention further provides a diagnostic kit for identifying cells comprising a 

nucleotide probe complementary to the sequence, or an oligonucleotide fragment thereof, 
shown in SEQ ID NO:1 , for example, for hybridization with mRNA from a sample of cells, e.g., 
muscle cells; means for detecting the nucleotide probe bound to mRNA in the sample with a 
standard. In a particular aspect, the invention is a probe having a nucleic acid molecule 

20 sufficiently complementary with a sequence identified as SEQ ID NO:1 , or its complement, so as 
to bind thereto under stringent conditions. "Stringent hybridization conditions" takes on its 
common meaning to a person skilled in the art here. Appropriate stringency conditions which 
promote nucleic acid hybridization, for example, 6x sodium chloride/sodium citrate (SSC) at 
about 45°C are known to those skilled in the art. The following examples are found in Current 

25 Protocols in Molecular Biology, John Wiley & Sons, NY (1 989), 6.3.1 -6.3.6: For 50 ml of a first 
suitable hybridization solution, mix together 24 ml formamide, 12 ml 20x SSC, 0.5 ml 2 M 
Tris-HCI pH 7.6, 0.5 ml 100x Denhardt's solution, 2.5 ml deionized H 2 O f 10 ml 50% dextran 
sulfate, and 0.5 ml 10% SDS. A second suitable hybridization solution can be 1% crystalline 
BSA (fraction V), 1 mM EDTA, 0.5 M Na 2 HP0 4 pH 7.2, 7% SDS. The salt concentration in the 

30 wash step can be selected from a low stringency of about 2x SSC at 50°C to a high stringency of 
about 0.2x SSC at 50°C. Both of these wash solutions may contain 0.1 % SDS. In addition, the 
temperature in the wash step can be increased from low stringency conditions at room 
temperature, about 22°C, to high stringency conditions, at about 65°C. The cited reference 
gives more detail, but appropriate wash stringency depends on degree of homology and length 

35 of probe. If homology is 100%, a high temperature (65°C to 75°C) may be used. If homology is 
low, lower wash temperatures must be used. However, if the probe is very short (<100bp), lower 
temperatures must be used even with 100% homology. In general, one starts washing at low 
temperatures (37°C to 40°C), and raises the temperature by 3-5°C intervals until background is 
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low enough not to be a major factor in autoradiography. The diagnostic kit can also contain an 
instruction manual for use of the kit. 

The invention provides a diagnostic kit which can be used to determine the 
genotype of mammalian genetic material, for example. One kit includes a set of primers used 
5 for amplifying the genetic material. A kit can contain a primer including a nucleotide sequence 
for amplifying a region of the genetic material containing one of the naturally occurring mutations 
described herein. Such a kit could also include a primer for amplifying the corresponding region 
of the normal gene that produces functional myostatin. Usually, such a kit would also include 
another primer upstream or downstream of the region of interest complementary to a coding 

1 0 and/or non-coding portion of the gene. A particular kit includes a primer selected from a non- 
coding sequence of a myostatin gene. Examples of such primers are provided in Table 3, 
designated as Exon1-5', Exon1-3\ Exon2-5\ Exon3-5' and Exon3-3\ These primers are used to 
amplify the segment containing the mutation of interest. The actual genotyping is carried out 
using primers that target specific mutations described herein and that could function as allele- 

15 specific oligonucleotides in conventional hybridization, Taqman assays, OLE assays, etc. 
Alternatively, primers can be designed to permit genotyping by microsequencing. 

One kit of primers thus includes first, second and third primers, (a), (b) and (c), 
respectively. Primer (a) is based on a region containing a myostatin mutation, for example a 
region of the myostatin gene spanning the nt821del(11) deletion. Primer (b) encodes a region 

20 upstream or downstream of the region to be amplified by primer (a) so that genetic material 
containing the mutation is amplified, by PCR, for example, in the presence of the two primers. 
Primer (c) is based on the region corresponding to that on which primer (a) is based, but lacking 
the mutation. Thus, genetic material containing the non-mutated region will be amplified in the 
presence of primers (b) and (c). Genetic material homozygous for the wild type gene will thus 

25 provide amplified products in the presence of primers (b) and (c). Genetic material homozygous 
for the mutated gene will thus provide amplified products in the presence of primers (a) and (b). 
Heterozygous genetic material will provide amplified products in both cases. 

The invention provides purified proteins having biological activity of myostatin. 
The terms "isolated" and "purified" each refer to a protein substantially free of cellular material or 

30 culture medium when produced by recombinant DNA techniques, or chemical precursors or 
other chemicals when chemically synthesized. In certain preferred embodiments, the protein 
having biological activity of myostatin comprises an amino acid sequence identified as SEQ ID 
NO:2. Furthermore, proteins having biological activity of myostatin that are encoded by nucleic 
acids which hybridize under stringent conditions, as discussed above, to a nucleic acid 

35 comprising a nucleotide sequence identified as SEQ ID NO:1 or SEQ ID NO:7 are encompassed 
by the invention. Proteins of the invention having myostatin activity can be obtained by expression 
in a suitable host ceil using techniques known in the art. Suitable host cells include prokaryotic 
or eukaryotic organisms or cell lines, for example, yeast, E. co//, insect cells and COS 1 cells. 
The recombinant expression vectors of the invention, described above, can be used to express a 
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protein having myostatinl activity in a host cell in order to isolate the protein. The invention 
provides a method of preparing an purified protein of the invention comprising introducing into a 
host cell a recombinant nucleic acid encoding the protein, allowing the protein to be expressed in 
the host cell and isolating and purifying the protein. Preferably, the recombinant nucleic acid is a 

5 recombinant expression vector. Proteins can be isolated from a host cell expressing the protein 
and purified according to standard procedures of the art, including ammonium sulfate 
precipitation, column chromatography (e.g. ion exchange, gel filtration, affinity chromatography, 
etc.), electrophoresis, and ultimately, crystallization (see generally, "Enzyme Purification and 
Related Techniques", Methods in Enzymology, 22, 233-577 (1971)). 

10 Alternatively, the protein or parts thereof can be prepared by chemical synthesis 

using techniques well known in the chemistry of proteins such as solid phase synthesis 
(Merrifield, 1964), or synthesis in homogeneous solution (Houbenwcyl, 1987). 

The protein of the invention, or portions thereof, can be used to prepare 
antibodies specific for the proteins. Antibodies can be prepared which bind to a distinct epitope in 

15 an unconserved region of a particular protein. An unconsented region of the protein is one which 
does not have substantial sequence homology to other proteins, for example other members of 
the myostatin family or other members of the TGFp superfamily. Conventional methods can be 
used to prepare the antibodies. For example, by using a peptide of a myostatin protein, 
polyclonal antisera or monoclonal antibodies can be made using standard methods. A mammal, 

20 (e.g. a mouse, hamster, or rabbit) can be immunized with an immunogenic form of the peptide 
which elicits an antibody response in the mammal. Techniques for conferring immunogenicity on 
a peptide include conjugation to carriers or other techniques well known in the art. For example, 
the peptide can be administered in the presence of adjuvant. The progress of immunization can 
be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other 

25 immunoassay can be used to assess the levels of antibodies. Following immunization, antisera 
can be obtained and, if desired, polyclonal antibodies isolated from the sera. 

To produce monoclonal antibodies, antibody producing cells (lymphocytes) can 
be harvested from an immunized animal and fused with myeloma cells by standard somatic cell 
fusion procedures, thus immortalizing these cells and yielding hybridoma cells. Such techniques 

30 are well known in the art. For example, the hybridoma technique originally developed by Kohler 
and Milstein (Kohler, 1975) as well as other techniques such as the human B-cell hybridoma 
technique (Kozbor, 1983), the EBV-hybridoma technique to produce human monoclonal 
antibodies (Cole, 1985), and screening of combinatorial antibody libraries (Huse, 1989). 
Hybridoma cells can be screened immunochemically for production of antibodies specifically 

35 reactive with the peptide, and monoclonal antibodies isolated. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with a protein having the biological activity of myostatin, or a peptide 
fragment thereof. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above for whole antibodies. For 
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example, F(ab% fragments can be generated by treating antibody with pepsin. The resulting 
Ffab^ fragment can be treated to reduce disulfide bridges to produce Fab' fragments. 

It is also known in the art to make chimeric antibody molecules with human 
constant regions. See, for example, Morrison et a/., Takeda et a/., Cabilly et a/., Boss et a/., 
5 Tanaguchi et a/., Teng et a/. (Morrison, 1985; Takeda, 1985; Cabilly; Boss; Tanaguchi; Teng, 
1982), European Patent Publication 0173494, United Kingdom Patent GB 2177096B, PCT 
Publication WO92/06193 and EP 0239400. It is expected that such chimeric antibodies would be 
less immunogenic in a human subject than the corresponding non-chimeric antibody. 

Another method of generating specific antibodies, or antibody fragments, 

1 0 reactive against protein having the biological activity of a myostatin protein, or a peptide fragment 
thereof, is to screen expression libraries encoding immunoglobulin genes, or portions thereof, 
expressed in bacteria, with peptides produced from the nucleic acid molecules of the present 
invention. For example, complete Fab fragments, VH regions and FV regions can be expressed 
in bacteria using phage expression libraries. See for example Ward et a/., Huse et a/., and 

15 McCafferty et a/. (Ward, 1989; Huse, 1989; McCafferty, 1990). Screening such libraries with, for 
example, a myostatin protein can identify immunoglobulin fragments reactive with myostatin. 
Alternatively, the SCID-hu mouse developed by Genpharm can be used to produce antibodies, 
or fragments thereof. 

The polyclonal, monoclonal or chimeric monoclonal antibodies can be used to 

20 detect the proteins of the invention, portions thereof or closely related isoforms in various 
biological materials, for example they can be used in an ELISA, radioimmunoassay or 
histochemical tests. Thus, the antibodies can be used to quantify the amount of a myostatin 
protein of the invention, portions thereof or closely related isoforms in a sample in order to 
determine the role of myostatin proteins in particular cellular events or pathological states. Using 

25 methods described hereinbefore, polyclonal, monoclonal antibodies, or chimeric monoclonal 
antibodies can be raised to nonconserved regions of myostatin and used to distinguish a 
particular myostatin from other proteins. 

The polyclonal or monoclonal antibodies can be coupled to a detectable 
substance or reporter system. The term "coupled" is used to mean that the detectable 

30 substance is physically linked to the antibody. Suitable detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
p-galactosidase, and acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 

35 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 

fluorescein, dansyl chloride and phycoerythrin; an example of a luminescent material includes 
luminol; and examples of suitable radioactive material include 125 l; t31 l, 35 S and 3 H. In a preferred 
embodiment, the reporter system allows quantitation of the amount of protein (antigen) present. 
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Such an antibody-linked reporter system could be used in a method for 
determining whether a fluid or tissue sample of a subject contains a deficient amount or an 
excessive amount of the protein. Given a normal threshold concentration of such a protein for a 
given type of subject, test kits could thus be developed. 
5 The present invention allows the skilled artisan to prepare bispecific antibodies 

and tetrameric antibody complexes. Bispecific antibodies can be prepared by forming hybrid 
hybridomas (Staerz, 1986a &b). 

Compositions of the invention are administered to subjects in a biologically 
compatible form suitable for pharmaceutical administration in vivo. By "biologically compatible 

1 0 from suitable for administration in vivo" is meant a form of the composition to be administered in 
which any toxic effects are outweighed by the therapeutic effects of the composition. The term 
"subject" is intended to include living organisms in which a desired therapeutic response can be 
elicited, e.g. mammals. Examples of subjects include cattle, human, dogs, cats, mice, rats and 
transgenic species thereof. Administration of a therapeutically active amount of the therapeutic 

1 5 compositions of the present invention is defined as an amount effective, at dosages and for 
periods of time necessary to achieve the desired result. For example, a therapeutically active 
amount of a compound that inhibits the biological activity of myostatin protein may vary according 
to factors such as the age, sex, and weight of the individual, as well as target tissue and mode of 
delivery. Dosage regimes may be adjusted to provide the optimum therapeutic response. For 

20 example, several divided doses may be administered daily or the dose may be proportionally 
reduced as indicated by the exigencies of the therapeutic situation. 

As far as the United States is concerned, this application is a Continuation-in- 
Part Application of prior United States Patent Application Serial No. 08/891 ,789, filed July 14, 
1997, the specification of which is incorporated herein by reference. 

25 Those skilled in the art will know, or be able to ascertain using no more 

than routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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CLAIMS 

1 . A method of increasing muscle mass of a mammal having muscle cells in which myostatin is 
expressed, the method comprising administering to the mammal an effective amount of a nucleic 
acid molecule substantially complementary to at least a portion of mRNA encoding the myostatin 

5 and being of sufficient length to sufficiently reduce expression of the myostatin to increas the 
muscle mass. 

2. The method of claim 1 wherein the mammal is bovine. 

3. A method of increasing muscle mass of a mammal, the method comprising administering to 
the mammal an effective amount of a nucleic acid molecule having ribozyme activity and a 

1 0 nucleotide sequence substantially complementary to at least a portion of mRNA encoding 
myostatin and being of sufficient length to bind selectively thereto to sufficiently reduce 
expression of the myostatin so as to increase the muscle mass. 

4. The method of claim 3 wherein the mammal is bovine. 

5. A diagnostic kit, for determining the presence of muscular hyperplasia in a mammal from 
1 5 which a sample containing DNA of the mammal has been obtained, the kit comprising: 

first and second primers for amplifying the DNA, the primers being complementary to 
nucleotide sequences of the DNA upstream and down stream, respectively, of a 
mutation in the portion of the DNA encoding myostatin which results in muscular 
hyperplasia of the mammal, wherein at least one of the nucleotide sequences is 
20 selected to be from a non-coding region of the myostatin gene. 

6. The diagnostic kit of claim 5, further comprising a third primer complementary to a naturally 
occurring mutation of a coding portion of the myostatin gene. 

7. A diagnostic kit, for determining the genotype of a sample of mammalian genetic material, the 
kit comprising: 

25 a pair of primers for amplifying a portion of the genetic material corresponding to a 

nucleotide sequence which encodes at least a portion of a myostatin protein, 
wherein a first of the primers includes a nucleotide sequence sufficiently 
complementary to a mutation of SEQ ID NO:1 to prime amplification of a nucleic 
acid molecule containing the mutation, the mutation being selected from the group 

30 of mutations resulting from: (a) deletion of 1 1 nucleotides beginning at nucleotide 

821 of the coding portion of SEQ ID NO:1; (b) deletion of 7 nucleotides beginning at 
nucleotide 419 of the coding sequence and insertion of the sequence 
AAGCATACAA in place thereof; (c) deletion of nucleotide 204 of the coding 
sequence and insertion of T in place thereof; (d) deletion of nucleotide 226 of the 

35 coding sequence and insertion of T in place thereof; and (e) deletion of nucleotide 

313 of the coding sequence and insertion of A in place thereof; and combinations 
thereof. 
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8. The diagnostic kit of claim 7 wherein a second of the pair of primers is located ntir ly 
upstream or entirely downstream of the selected mutation or mutations. 

9. The diagnostic kit of claim 8 wherein a first said primer spans mutation (a) and further 
comprising a third primer which is sufficiently complementary to the nucleotide sequence 

5 identified as SEQ ID NO:11 to prime amplification of a nucleic acid molecule containing SEQ ID 
NO:1l. 

10. The diagnostic kit of claim 8 wherein a first said primer is sufficiently complementary to the 
inserted sequence of mutation (b) to prime amplification of a nucleic acid molecule containing 
mutation (b) and further comprising a third primer which is sufficiently complementary to the 

10 sequence corresponding to the 7 nucleotide deletion of mutation (b) to prime amplification of a 
nucleic acid molecule containing the 7 nucleotide deletion of mutation (b). 

1 1 . The diagnostic kit of claim 8 wherein a first said primer spans mutation (c) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (c) to prime amplification of a nucleic acid molecule 

15 lacking mutation (c). 

12. The diagnostic kit of claim 8 wherein a first said primer spans mutation (d) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (d) to prime amplification of a nucleic acid molecule 
lacking mutation (d). 

20 13, The diagnostic kit of claim 8 wherein a first said primer spans mutation (e) and further 
comprising a third primer which is sufficiently complementary to the sequence spanning the 
corresponding region lacking mutation (e) to prime amplification of a nucleic acid molecule 
lacking mutation (e). 

14. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
25 method comprising: 

obtaining a sample of material containing DNA from a said animal; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
wherein the absence of DNA having said nucleotide sequence indicates the presence of 
30 muscular hyperplasia in the animal. 

15. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the DNA in the 
presence of primers based on a nucleotide sequence encoding a protein having biological activity 
of myostatin. 

35 16. The method of claim 15 wherein DNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 
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17. Th method of claim 14, wherein ascertaining whether DNA having a nucleoid sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the DNA 
in the presence of primers based on a nucleotide sequence encoding the N-terminal and the C- 
terminal, respectively, of the protein having biological activity of myostatin. 
5 18. The method of claim 14, wherein ascertaining whether DNA having a nucleotide sequenc 
encoding a protein having biological activity of myostatin is present includes amplifying th DNA 
in the presence of first and second primers based on first and second nucleotide sequenc s 
encoding spaced apart regions of the protein, wherein said regions flank a mutation known to 
naturally occur and which when present in both alleles of a said animal results in said muscular 
10 hyperplasia. 

19. The method of claim 18 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 
sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 1 1- 

1 5 base pair deletion beginning at base pair no. 821 , and said first primer is selected to be upstream 
of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

20. The method of claim 14 wherein a DNA of said animal not displaying muscular hyperplasia 
contains a nucleotide sequence which hybridizes under stringent conditions with a nucleotide 

20 sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the coding 
sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 11- 
base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
nucleotide sequence including base pair nos. 820 and 821 of the DNA sequence containing said 
deletion. 

25 21 . The method of claim 19 wherein the animal is of a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

22. The method of claim 20 wherein the animal is a breed selected from Belgian Blue, 
Asturiana, Parthenaise and Rubia Gallega. 

23. The method of claim 14 wherein ascertaining whether DNA having a nucleotide sequence 
30 encoding a protein having biological activity of myostatin is present includes amplifying the DNA 

in the presence of a primer containing at least a portion of a mutation known to naturally occur 
and which when present in both alleles of a said animal results in said muscular hyperplasia. 

24. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 

35 obtaining a sample of material containing DNA from a said animal; and 

ascertaining whether DNA having a mutation as defined in claim 7 is present; and 
ascertaining whether DNA having a nucleotide sequence encoding a protein having 
biological activity of myostatin is present, 
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wherein the absence of DNA having said nucleotide sequenc and presence of a said mutation 
indicates the presence of muscular hyperplasia in the animal. 

25. A method for determining the presence of muscular hyperplasia in a bovine animal, the 
method comprising: 
5 obtaining a sample of the animal containing mRNA; and 

ascertaining whether an mRNA encoding a protein having biological activity of myostatin 
is present in the sample, 
wherein the absence of said mRNA indicates the presence of muscular hyperplasia in the 
animal. 

V 

1 0 26. The method of claim 25 wherein the sample is of muscle tissue or wherein the tissue is 
skeletal muscle tissue. 

27. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin includes amplifying the mRNA in the 
presence of primers substantially complementary to the nucleotide sequence encoding the 

15 protein. 

28. The method of claim 27 wherein mRNA of a said bovine animal not displaying muscular 
hyperplasia has a nucleotide sequence which is capable of hybridizing with a nucleic acid 
molecule having the sequence identified as SEQ ID NO:1 under stringent hybridization 
conditions. 

20 29. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of primers substantially complementary to a nucleotide sequence encoding the 
N-terminal and the C-terminal, respectively, of the protein having biological activity of myostatin. 
30. The method of claim 25, wherein ascertaining whether mRNA having a nucleotide sequence 

25 encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of first and second primers substantially complementary to first and second 
nucleotide sequences encoding spaced apart regions of the protein, wherein said regions flank a 
mutation known to naturally occur and which when present in both alleles of a said animal results 
in said muscular hyperplasia. 

30 31 . The method of claim 30 wherein an mRNA of said animal not displaying muscular 

hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 
nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and the 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1 -base pair deletion beginning at base pair no. 821 , and said first primer is selected to be 

35 upstream of the codon encoding glutamic acid no. 275 and the second primer is selected to be 
downstream of the codon encoding aspartic acid no. 274. 

32. The method of claim 25 wherein ascertaining whether mRNA having a nucleotide sequence 
encoding a protein having biological activity of myostatin is present includes amplifying the mRNA 
in the presence of a primer containing a nucleotide sequence complementary to at least a 
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portion of a mutation known to naturally occur in a said animal and which when present in both 
alleles of a said animal results in said muscular hyperplasia. 

33. The method of claim 32 wherein an mRNA of said animal not displaying muscular 
hyperplasia contains a nucleotide sequence which hybridizes under stringent conditions with a 

5 nucleotide sequence encoding a protein having a sequence identified as SEQ ID NO:2 and th 
coding sequence of DNA of a said animal displaying muscular hyperplasia is known to contain an 
1 1 -base pair deletion beginning at base pair no. 821 , and said primer is selected to span the 
deleted portion. 

34. The method of claim 31 wherein the animal is of a breed selected from Belgian Blue, 
1 0 Asturiana, Parthenaise and Rubia Gallega. 

35. A method for determining the presence of muscular hyperplasia in a mammal, the method 
comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether a sequence of the DNA encoding (a) a protein having biological activity 
15 of myostatin, is present, and whether a sequence of the DNA encoding (b) an allelic 

protein lacking the activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 

36. The method of claim 35 wherein (b) contains a naturally occurring mutation responsible for 
20 the lack of activity. 

37. The method of claim 35 wherein the mammal is a human. 

38. The method of claim 37 wherein ascertaining whether a sequence of the DNA encoding (a) 
is present, and whether a sequence of the DNA encoding (b) is present includes amplifying the 
DNA 

25 in the presence of primers based on a nucleotide sequence encoding a protein having biological 
activity of myostatin. 

39. The method of claim 38 wherein said primers are based on the sequence identified as SEQ 
ID NO:7. 

40. A method for determining the presence of muscular hyperplasia in a mammal, the method 
30 comprising: 

obtaining a sample of material containing mRNA from the mammal; and 

ascertaining whether a sequence of the mRNA encoding (a) a protein having biological 

activity of myostatin, is present, and whether a sequence of the mRNA encoding 

(b) a protein at least partially encoded by a truncated nucleotide sequence 
35 corresponding to substantially the sequence of the mRNA and lacking the 

activity of (a), is present; 
wherein the absence of (a) and the presence of (b) indicates the presence of muscular 

hyperplasia in the mammal. 
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41 . The method of claim 40 wh rein th mRNA encoding (a) and the truncated sequence 
correspond to alleles of DNA of the mammal. 

42. The method of claim 40 wherein the mammal is human. 

43. The method of claim 42 wherein ascertaining whether a sequence of the mRNA encoding 

5 (a) is present, and whether a sequence of the mRNA encoding (b) is present includes amplifying 
the mRNA in the presence of a pair of primers complementary to a nucleotide sequence 
encoding a protein having biological activity of myostatin. 

44. The method of claim 43 wherein each said primer contains a truncated nucleotide sequence 
substantially complementary to a portion of the sequence identified as SEQ ID NO:7. 

1 0 45. The method of claim 44 wherein the truncated sequence contains at least 50 consecutive 
nucleotides substantially corresponding to about 10, or between about 10 and 20, or between 
about 20 and 30, or between about 30 and 40, or between about 40 and 50 consecutive 
nucleotides of SEQ ID NO:7. 

46. A method for determining the presence of muscular hyperplasia in a mammal, the method 
15 comprising: 

obtaining a tissue sample of containing mRNA of the mammal; and 
ascertaining whether an mRNA encoding a mutant type myostatin protein lacking 
biological activity of myostatin is present, 
wherein the presence of a said mRNA encoding a mutant type myostatin protein indicates the 
20 presence of muscular hyperplasia in the mammal. 

47. The method of claim 46 wherein the mutant type myostatin protein lacing biological activity is 
encoded by a naturally occurring allele of DNA encoding the mRNA. 

48. A method for determining the presence of double muscling in a bovine animal, the method 
comprising: 

25 obtaining a sample of material containing DNA from the animal; and 

ascertaining whether the DNA contains the nucleotide coding sequence identified as 
SEQ ID NO:11, 

wherein absence of the sequence indicates double muscling in the animal. 

49. The method of claim 34 wherein the animal is of a breed selected from Belgian Blue, 
30 Asturiana, Parthenaise and Rubia Gallega. 

50. A method for determining the myostatin genotype of a mammal, comprising: 

obtaining a sample of material containing nucleic acid of the mammal, wherein the nucleic 

acid is uncontaminated by heterologous nucleic acid; 
ascertaining whether the sample contains a (i) nucleic acid molecule encoding a protein 
35 having biological activity of myostatin; and 

ascertaining whether the sample contains an (ii) allelic nucleic acid molecule encoding a 
protein lacking biological activity of myostatin. 

51 . The method of claim 50 wherein the mammal is human and (i) comprises a nucleic acid 
sequence substantially homologous with the sequence identified as SEQ ID NO:7. 
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52. A purified protein having biological activity f myostatin, and having an amin acid s quence 
identified as SEQ ID N02, or a conservatively substituted variant thereof. 

53. An isolated nucleic acid molecule encoding a protein of claim 52. 

54. An isolated nucleic acid molecule comprising a DNA molecule having the nucleotide 

5 sequence identified as SEQ ID NO:1 or which varies from the sequence due to the degeneracy 
of the genetic code, or a nucleic add strand capable of hybridizing with at least one said nucleic 
acid molecule under stringent hybridization conditions. 

55. Isolated mRNA transcribed from DNA having a sequence which corresponds to a nucleic 
acid molecule according to claim 54. 

10 56. Isolated DNA having a sequence according to claim 54 in a recombinant cloning vector. 

57. A microbial cell containing and expressing heterologous DNA which is complementary a 
nucleic acid molecule of claim 54. 

58. A transfected cell line which expresses a protein of claim 52. 

59. A process for producing the protein of claim 52 comprising: 

1 5 preparing a DNA fragment including a nucleotide sequence which encodes said protein; 

incorporating the DNA fragment into an expression vector to obtain a recombinant DNA 
molecule which includes the DNA fragment and is capable of undergoing 
replication; 

transforming a host cell with said recombinant DNA molecule to produce a transformant 
20 which can express said protein; 

culturing the transformant to produce said protein; and 
recovering said protein from resulting cultured mixture. 

60. A method of increasing muscle mass in a mammal, comprising administering an effective 
amount of an antibody to myostatin to said mammal. 

25 61 . A method of increasing muscle mass in a mammal, comprising raising an autoantibody to 
the myostatin the in the mammal. 

62. The method of claim 61 wherein raising the autoantibody includes administering a protein 
having myostatin activity to the mammal. 

63. A method of increasing muscle mass in a mammal in need thereof, comprising 

30 administering to the mammal an effective amount of an antisense nucleic acid or oligonucleotide 
substantially complementary to at least a portion of the sequence identified as SEQ ID NO:1 or 
SEQ ID NO:5, or SEQ ID NO:7. 

64. The method of claim 63 wherein the portion is at least 5 nucleotide bases in length. 

65. The method of claim 64 wherein the mammal is a bovine and the sequence is the sequence 
35 identified as SEQ ID NO:1 . 

66. A method of increasing muscle mass in a mammal, comprising administering to the 
mammal an effective amount of an antibody to the myostatin. 

67. A probe comprising a nucleic acid molecule sufficiently complementary with a sequence 
identified as SEQ ID NO:1 , or its complement, so as to bind thereto under stringent conditions. 
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68. The probe of claim 67 wherein the sequence is between about 8 and about 1 1 95 
nucleotides in length, or between about 15 and 1195 nucleotides in length, or between about 25 
and 1195 nucleotides in length, or between about 35 and 1195 nucleotides in length, or between 
about 45 and 1195 nucleotides in length, or between about 55 and 1195 nucleotides in length, or 

5 between about 65 and 1 1 95 nucleotides in length, or between about 75 and 1 1 95 nucleotid s in 
length, or between about 75 and 1195 nucleotides in length, or between about 85 and 1 195 
nucleotides in length, or between about 95 and 1 195 nucleotides in length, or between about 105 
and 1195 nucleotides in length, or between about 115 and 1195 nucleotides in length. 

69. A method for identifying a nucleotide sequence of a mutant gene encoding a myostatin 
1 0 protein of a mammal displaying muscular hyperplasia, the method comprising: 

obtaining a sample of material containing DNA from the mammal; and 

probing the sample using a nucleic acid probe based on a nucleotide sequence of a 

known gene encoding myostatin in order to identify nucleotide sequence of the 

mutant gene. 

15 70. The method of claim 69, wherein the probe is based on a nucleotide sequence of a non- 
coding region of the gene. 

71 . The method of claim 70 wherein the probe is based on SEQ ID NO:54. 

72. The method of claim 71 wherein the probe is at least 8 nucleic acids in length. 

73. The method of claim 69, wherein the step of probing the sample includes exposing the DNA 
20 to the probe under hybridizing conditions and further comprising isolating hybridized nucleic acid 

molecules. 

74. The method of claim 73, further comprising the step of sequencing isolated DNA. 

75. The method of claim 69, wherein the mammal is a bovine mammal and the probe is based 
on a said nucleotide sequence identified as SEQ ID NO:1. 

25 76. The method of claim 74, further comprising the step of isolating and sequencing a cDNA or 
mRNA encoding the complete mutant myostatin protein. 

77. The method of claim 71 , further comprising the step of isolating and sequencing a functional 
wild type myostatin from a said mammal not displaying muscular hyperplasia. 

78. The method of claim 76, further comprising comparing the complete coding sequence of the 
30 complete mutant myostatin protein with, if the coding sequence for a functional wild type 

myostatin from a said mammal is previously known, (1) the known sequence, or if the coding 
sequence for a functional wild type myostatin from a said mammal is previously unknown, (2) the 
sequence determined according to claim 74 or claim 77, to determine the location of any 
mutation in the mutant gene. 
35 79. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing DNA from the mammal; and 
ascertaining whether the DNA contains a said mutation determined according to claim 
78. 
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80. A method for determining the myostatin genotype of a mammal, wherein wild type myostatin 
of the mammal is substantially that of claim 78, comprising: 

obtaining a sample of material containing mRNA from the mammal; and 
ascertaining whether the mRNA contains a said mutation determined according to claim 
5 78. 

81. A primer composition useful for the detection of a nucleotide sequence encoding a myostatin 
comprising a first nucleic acid molecule based on a nucleotide sequence located upstream of a 
said mutation determined according to claim 78 and a second nucleic acid molecule based on a 
nucleotide sequence located downstream of the mutation. 

10 82. A probe comprising a nucleic acid molecule based on a nucleotide sequence of claim 74 or 
claim 76 and spanning a said mutation determined according to claim 78. 
83. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
phenotype being conferred by a transgene contained in the somatic and germ cells of the 
mammal, the transgene encoding a myostatin protein having a dominant negative mutation. 

15 84. The transgenic mammal of claim 83 wherein the mammal is male and non-human and the 
transgene is located on the Y chromosome. 

85. The transgenic mammal of claim 83 wherein the mammal is bovine and the transgene is 
located to be under the control of a promoter which normally a promoter of a myosin gene. 

86. A transgenic mammal having a phenotype characterized by muscular hyperplasia, said 
20 phenotype being conferred by a transgene having a sequence antisense to that encoding a 

myostatin protein of the mammal. 

87. The transgenic mammal of claim 86 wherein the mammal is bovine and the transgene is 
located on the Y chromosome. 

88. The transgenic mammal of claim 86 wherein the transgene further comprises a sequence 
25 which when transcribed obtains an mRNA having ribozyme activity. 

89. A transgenic non-human mammal having a phenotype characterized by muscular 
hyperplasia, said phenotype being inducible and being conferred by a myostatin gene flanked by 
J oxP sides and a Cre transgene under the dependence of an inducible promoter. 

90. A transgenic non-human male mammal having a phenotype characterized by muscular 
30 hyperplasia , said phenotype being conferred by a myostatin gene flanked by J oxP sides and a 

Cre transgene located on the Y chromosome. 

91 . A method for determining whether a sample of mammalian genetic material is capable of a 
conferring a phenotype characterized by muscular hyperplasia, comprising ascertaining whether 
the genetic material contains a nucleotide sequence encoding a protein having biological activity 

35 of myostatin, wherein the absence of said sequence indicates the presence of muscular 
hyperplasia in the animal. 

92. A transgenic bovine having a genome lacking a gene encoding a protein having biological 
activity of myostatin. 
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93. A transgenic mouse having a genome containing a gene encoding a human protein having 
biological activity of myostatin or containing a gene encoding a bovine protein having biological 
activity of myostatin. 

94. A transgenic bovine having a gene encoding a bovine protein having biological activity of 
5 myostatin and heterologous nucleotide sequence antisense to the gene. 

95. A transgenic bovine of claim 94. further comprising a gene encoding a nucleic acid 
sequence having ribozyme activity and in transcriptional association with the nucleotide sequence 
antisense to the gene. 
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SEQUENCE ID NO. 1 



1 AGGAAGAATA AGAACAAGGG AAAAGATTGT ATTGATTTTA AAACCATGCA 
51 AAAACTGCAA ATCTCTGTTT ATATTTACCT ATTTATGCTG ATTGTTGCTG 
101 GCCCAGTGGA TCTGAATGAG AACAGCGAGC AGAAGGAAAA TGTGCAAAAA 
151 GAGGGGCTCT GTAATGCATG TTTGTGCAGG GAAAACACTA CATCCTCAAG 
201 ACTAGAAGCC ATAAAAATCC AAATCCTCAG TAAACITCGC CTGGAAACAG 
251 CTCCTAACAT CAGCAAAGAT GCTATCAGAC AACTTTTGCC CAAGGCTCCT 
301 CCAGTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAG ATCCCAGGAG 
351 TGACGGCTCC TTGGAAGACG ATGACTACCA CGCCAGCACG GAAACCGTCA 
401 TTACCATGCC CACGGAGTCT GATCTTCTAA CGCAAGTGGA AGCAAAACCC 
451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 
501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCC TGTCAAGACT CCTGCGACAG 
551 TGTTTGTGCA AATCCTGACA CTCATCAAAC CCATGAAAGA CGGTACAAGG 
601 TATACTGGAA TCCGATCTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 
651 TTCCCAGAGC ATTGATGTGA AGACACTGTT GCAGAACTGG CTCAAACAAC 
701 CTGAATCCAA CTTAGGCATT GAAATCAAAG C'l'lTAGATCA GAATGGCCAT 
751 GATCTTGCTG TAACCTTCCC AGAACCAGGA QAAGATGGAC TGACTCCTTT 
801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGAITTTG 
851 GGCTTGATTG TGATGAACAC TCCACAGAAT CTCGATGCTG TCGTTACCCT 
901 CTAACTGTGG ATTTTGAAGC TTTTGGATGG GATTGGATTA TTGCACCTAA 
951 AAGATATAAG GCCAATTACT GCTCTGGAGA ATGTGAATTT GTATTTTTGC 
1001 AAAAGTATCC TCATACCCAT CTTGTGCACC AAGCAAACCC CAGACGTTCA 
1051 CCCGGCCCCT GCTGTACTCC TACAAAGATG TCTCC AATTA ATATGCTATA 
1101 TTTTAATGGC GAAGGACAAA TAATATACGG GAAGATTCCA GCCA'I'CCTAG 
1151 TAGATCGCTG TGGGTGTTCA TGAGTCTATA TTtGGgTTCA TAAGC 
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SEQUENCE ID NO. 2 



1 fWQKLQISVY IYLPMLIVAG PVDLNENSEQ KKNVEKEGLC NACI.WRENTT 

51 SSRLEATKIQ ILSKI.RLETA PNTSKDAIRQ LLPKAFPLLE L1DQFDVQRD 

101 ASSDGSLEDD DYIIARTETVI TMPTESDLLT UVEGKPKCCP FEFSSKIQYN 

151 KT.VKAQLWIY LRPVKTPATV FVQILRLTKP WJOX5TRYTGI RSLKLDMNPG 

201 TGTWQSIDVK TVTjQNWLKQP ESNLGIETKA LDl£NQHDLAV TFPEPGEDGL 

2S1 TPFLEVKVTD TPKRSRRJDFC LDCDEHSTKS RCCF.YPLTVn FEAFCWDWTI 

3 01 APKRYKANYC SCECEFVFLQ KYPHTHIiVHQ ANPRGSAGPC CTPTKMSPIN 

351 WLYFNGECQI TYGKIPAMW DRCGCS* 'u^l UW i U 
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SEQUENCE ID NO. 3 



1 AGGAAGAATA AGAACAAGGG AAAAGATTGT ATTGATTTTA AAACCATGCA 
51 AAAACTCCAA ATCTCTGTTT ATATTTACCT ATTTATGCl 'G ATTGTTGCTG 
101 GCCCAGTCCA TCTGAATGAG AACAGCGAGC AGAAGGAAAA 'IfjTGGAAAAA 
15? CAGGGGCTGT GTAATGCATG TTTGTGGAGG GAAAACACTA CATCCTCAAG 
201 ACTACAAGCC ATAAAAATCC AAATCCTCAG TAAACTTCGC CTGGAAACAG 

251 CTCCTAACAT GAGCAAAGAT GCTATCAGAC AACTTTTCCC CAAGGCTCCT 

301 CCACTCCTGG AACTGATTGA TCAGTTCGAT GTCCAGAGAC ATGCCAGCAG 

351 TGACGGCTCC TTGGAAGACG ATGACTACCA CGCCAGGAGG CAAACGGTCA 

401 TTACCATGCC CAGGGAGTCT GATCTTCTAA CGCAAGTGGA AGGAAAACCC 

451 AAATGTTGCT TCTTTAAATT TAGCTCTAAG ATACAATACA ATAAACTAGT 

501 AAAGGCCCAA CTGTGGATAT ATCTGAGGCG TGTCAAGACT CCTGCGACAG 

551 TGTTTGTGCA AATCCTCACA CTCATCAAAC CCATGAAAGA CGGTACAAGO 

601 TATACTGGAA TCCGATGTCT GAAACTTGAC ATGAACCCAG GCACTGGTAT 

651 TTGGCAGAGC ATTGATGTGA AGACAGTGTT GCAGAACTGG CTCAAACAAC 

701 CTGAATCCAA CTTAGGCATT GAAATCAAAG GTTTAGATGA GAATGGCCAT 

751 GATCTTGCTG TAACCTTCCC AGAACCAGGA GAAGATGGAC TGACTCCTTT 

801 TTTAGAAGTC AAGGTAACAG ACACACCAAA AAGATCTAGG AGAGATTTTG 

851 GGCTTGATTG TGACAGAATC TCGATGCTGT CGTTACCCTC TAACTGTGGA 

901 TTTTGAAGCT TTTGGATGGG ATTGGATTAT TCCACCTAAA AGATATAAGG 

951 CCAATTACTG CTCTGGAGAA TGTGAATTTG TATTTTTGCA AAAGTATCCT 

1001 CATACCCATC TTGTGCACCA AGCAAACCCC AGAGGTTCAG CCGGCCCCTG 

1051 CTGTACTCCT ACAAAGATGT CTCCAATTAA TATGCTATAT TTTAATGGCG 

1101 AAGGACAAAT AATATACGGG AACATTCCAG CCATGGTAGT AAATCGCTGT 

1151 GGGTGTTCAT GAGGTCTATA TTTGGTTCAT AGCTTCCTCA AACATGGAAG 

1201 GTCTTCCCCT CAACAATTTT GAAACTGTTG AAATTATGT 
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SEQUENCE ID NO. 4 



1 tMQKLQISVY IYLFMLIVAG PVDLNENSEQ KENVEKEGLC NACLWKKNTT 

51 SSRDEAIKIQ ILSKLRLETA PNISKDAIRQ I.LPKAPFLLE LlbyFDVQRD 

101 ASSDGSLEDD DYfiARTETVI TMPTESDLLT QVEGKPKCCF PKPSfcJKIQYN 

151 KI1VKAQI1WIY T.RPVKTPATV FVQIIiRLIKP MKDGTKYTGI RST.Kl.lJMNPG 

201 TGIWQSIDVK TVLQNWLKQP ESNLGTEIKA LDKNGHDLAV TFPEPG&1X5.L 

251 TPFLEVKVTD TPKR.SRRDFG TjDCDRISMLS LPSNCGF 
301 

351 
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SEQUENCE ID NO. 5 



1 


GTCTCTCGGA CGGTACATGC 


ACTAATATTT 


CACTTGGCAT TACTCAAAAG 


SI 


CAAAAAGAAG 


AAATAAGAAC 


AACCGAAAAA 


AAAAGATTGT GCTGATTTTT 


101 


AAAATGATGC 


AAAAACTGCA AATGTATGTT 


TATATTTACC TCTTCATOCT 


151 


GATTGCTGCT 


GGCCCAGTGG 


ATCTAAATGA 


GGGCAGTGAG AGAGAAGAAA 


201 


ATGTGGAAAA 


AGAGGGGCTG 


TGTAATGCAT 


GTGCGTGGAG ACAAAACACG 


251 


AGGTACTCCA 


GAATAGAACC 


CATAAAAATT 


CAAATCCTCA GTAAGCTGCO 


301 


CCTGGAAACA 


GCTCCTAACA 


TCAGCAAAGA 


TGCTATAAGA CAACTTCTGC 




CAAGACCGCC 


TCCACTCCGG 


GAACTGATCG 


ATCAGTACGA CGTCCACAGG 


401 


GATGACACCA 


GTGATGGCTC 


TTTGGAAGAT 


GACGATTATC ACGCTACCAC 


451 


GGAAACAATC 


ATTACCATGC 


CTACACAGTC 


TGACTTTCTA ATGCAACJCGG 


501 


ATGGCAAGCC 


CAAATGTTGC 

* 


TTTTTTAAAT 


TTAGCTCTAA AATACAGTAC 


551 


AACAAAGTAG 


TAAAAGCCCA ACTGTGGATA 


TATCTCAGAC CCGTCAAGAC 


601 


TCCTACAACA 


GTGTTTCTGC 


AAATCCTOAG 


ACTCATCAAA CCCATGAAAG 


651 


ACCCTACAAG 


GTATACTGGA 


ATCCGATCTC 


TGAAACTTGA CATGAGCJCCA 


701 


GGCACTGGTA 


TTTGGCAGAG 


TATOGATGTG 


AAGACAGTGT TGCAAAATTG 


751 


GCTCAAACAG 


CCTGAATCCA 


ACTTAGGCAT 


TGAAATCAAA GCTTTGGATG 


801 


AGAATGGCCA 


TGATCTTCCT 


GTAACCTTCC 


CAGGACCAGG AGAAGATGGG 


851 


CTGAATCCCT 


TTTTAGAACT 


CAAGGTGACA 


GACACACCCA AGAGGTCCCG 


901 


GAG AG AC TTT 


GGGCTTGACT 


GCGATGAGCA 


CTCCACGGAA TCCCGGTGCT 


951 


GCCGCTACCC 


CCTCACGGTC 


GATTTTGAAG 


CCTTTGGATG GOACTGGATT 


1001 


ATCGCACCCA 


AAAGATATAA 


GGCCAATTAC 


TGCTCAGGAG AGTGTGAATT 


1051 


TGTGTTTTTA 


CAAAAATATC 


COCATACTCA 


TCTTGTGCAC CAAGCAAACC 


1101 


CCAGAGGCTC 


AGCAGGCCCT 


TGCTGCACTC 


CGACAAAAAT GTCTCCCATT 


1151 


/-aTATGCTAT 


ATTTTAATGG 


CAAAGAACAA ATAATATATG GCJAAAATTCC 


1201 


AGCCATGGTA 


GTAGACCGCT 


CTCGGTGCTC 


ATGAGCTTTG CATTAGGTTA 


1251 


GAAACTTCCC 


AACTCATGGA 


AGGTCTTCCC 


CTCAATTTCG AAACTGTGAA 
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1301 TTCAAGCACC ACAGGCTGTA GGCCTTGAGT ATGCTCTACT AACGTAAGCA 
1351 CAAGCTACAG TGTATGAACT AAAAGAGAGA ATAGATGCAA TGGTTGGCAT 
1A01 TCAACCACCA AAATAAACCA TACTATAGGA TGTTGTATGA TTTCCAGAGT 
1451 TTTTGAAATA GATGGAGATC AAATTACATT TATGTCCATA TATOTATATT 
1501 ACAACTACAA TCTAGCCAAG GAAGTGAGAG CACATCTTGT GGTCTGCTOA 
1531 GTTAGGAGGG TATGATTAAA AGGTAAAGTC TTATTTCCTA ACAGTTTCAC 
1601 TTAATATTTA CAGAACAATC TATATGTAGC CTTTGTAAAG TGTAGGATTG 
1651 TTATCATTTA AAAACATCAT GTACACTTAT ATTTGTATTG TATACTTGGT 
1701 AAGATAAAAT TCCACAAAGT AGGAATGGGG CCTCACATAC ACATTGCCAT 
1751 TCCTATTATA ATTGGACAAT CCACCACGGT GCTAATGCAG TGCTCAATGG 
1801 CTCCTACTGG ACCTCTCGAT AGAACACTCT ACAAAGTACG AGTCTCTCTC 
1851 TCCCTTCCAG GTGCATCTCC ACACACACAG CACTAAGTGT TCAATGCATT 
1901 TTCTTTAAGG AAAGAAGAAT CTTTTTTTCT AGAGGTCAAC TTTCAGTCAA 
1951 CTCTAGCACA GCGGGAQTGA CTGCTGCATC XTAAAAGGCA GCCAAACAGT 
2001 ATTCATTTTT TAATCTAAAT TTCAAAATCA CTGTCTGCCT TTATCACATG 
2051 CCAATTTTGT GGTAAAATAA TGGAAATGAC TGGTTCTATC AATATTGTAT 
2101 AAAAGACTCT CAAACAATTA CATTTATATA ATATGTATAC AATATTGTTT 
2151 TGTAAATAAG TGTCTCCTTT TATATTTACT TTGGTATATT TTTACACTAA 
2201 TGAAATTTCA AATCATTAAA GTACAAAGAC ATGTCATGTA TCACAAAAAA 
2251 GGTGACTGCT TCTATTTCAG AGTGAATTAG CAGATTCAAT AGTGGTCTTA 
2301 AAACTCTGTA TGTTAAGATT AGAAGGTTAT ATTACAATCA ATTTATGTAT 
2351 TTTTTACATT ATCAACTTAT GGTTTCATGG TGGCTGTATC TATGAATGTG 
2401 GCTCCCAG7C AAATTTCAAT GCCCCACCAT TTTAAAAATT ACAAGCATTA 
2451 CTAAACATAC CAACATGTAT CTAAAGAAAT ACAAATATGG TATCTCAATA 
2501 ACAGCTACTT TTTTATTTTA TAATTTGACA ATGAATACAT TTCTTTTATT 
2551 TACTTCAGTT TTATAAATTG GAACTTTGTT TATCAAATGT ATTGTACTCA 
2601 TAGCTAAATG AAATTATTTC TTACATAAAA ATGTGTAGAA ACTATAAATT 
2651 AAAGTGTTTT CACATTTTTG AAACCC 
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SEQUENCE ID NO. 6 



1 MMQKLQMYVY IYLFMLXAAG PVDLNEGSER EENVKKECLC NACAWRQNTR 

51 YSRIEAIKIQ ILSKLRLETA FNISKDAIRQ LLPRAPPLRE LIDQYDVQRD 

101 DSSDGSTiEDD DYHATTETI I TMPTESDFLM QADGKPKCCF FKFSSKIQYN 

151 KWKAQLWIY LRPVKTPTTV FVQILRLIKP MKDGTRYTGT RST.KT,73M5PG 

201 TGIWQSIDVK TVLQNWLKQP ESNLGXEXKA LDENGHDLAV TFPGPGEDGL 

251 NPFLEVKVTD T PKR S RRDFG LDCDEHSTES RCCRYPLTVD FEAFGWDWII 

3 01 APKRYKANYC SGECE3TVFL.Q KYPHTHLVHQ ANPRGSAGPC CTPTKMSPIN 

351 MLYFNGKEOI IYGKIPAMW DRCGCS* 
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SEQUENCE ID NO. 7 



ha 3 7 5 3 tf GATTTTCTAATOr AACTOGATCCAAAACCC 

he37 53 AAATCTTCCITCTTTAAAtrrrAOCTCTAAAATACAATACAATAAA G TA GTAAA GG CCCAA 

h«375 3 CTATCCATATATTTGAGACCCGTCOACLACTCCTA^ 



hc37 5 3 CTGATCAAACCTATOAAAaACGGTACAACCXATACTGGAATCCGATCTCTGAAACTTGAC 
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h*37S3 ATCAACC€ACGC&CTG(]TATTTCr»CAaA -MKTTCUlTtftaiUlGliClkCTOTTGCAAAATT^ 



h*3753 CTrCAAACAACCTOAAXCCAACTTAOCXy^^ 



h*37S3 GATCTTOCTOTAACCTTCCCACGACCA GGAAGAAGATGCfOCTCM ATCCCTTTTTTAAOAA 



h«3 7 5 3 OCTCAAGQTAACAGACACAraiXAAAAQATTCCAUAAQCCUV T TTrCCGTCTTGACTqOTGA 



hr 37 5 3 TGAGCA-CTCAACAOAATCACGATCCTCTrCOTT 



ho 37 5 3 TTCCCATCOCATTOOA-TATCa 



ho783 3 AOCOATQOTAOTAQACCCJCTafTOOaTaftTC! 



h«7«23 ATOAGATTTATATTAAC CCTTCAT AACTTCCTAAAACATGGAAGGTTTTCCCCTCAACAA 
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h»7823 TTTTQAAaCTOTCAAAT^ 



h57823 AAOCATAAOCTAGAGTATGTAAACTAAAAGG^^ 



hA702J TTTAACCATCCAAACAAATCATAC - • CAGAAAGTTTTATCiATTTGCANAGTTTTTTNAGG 



h* 7 6 2 3 Cl^OAAAOQAOGAOTCAAANTTTCANITCTTATaaT 



2 02 7 ATTTCGGCACAGOTNAAACACTT GAA TTTATATTGTATGGTA GTATA 



h9202 7 CTTGGTAAGATAAAATTCGACAAAAATAGOOATGGTGCAGCATATaCA - ATTTCCATTCC 



h9 2027 TATTATAATTC^CACAOTACATTAACAATCCATC^CAACGCTCCTAATACGATAGOCTGA 
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n9 5 22 7 TAAATCTCAAC<TrrCCATTATTTTAATACT^^ 



a? 5 32 7 ATTOACTCTATTAXC - TO - AAATOAAO - AATAAACT0ATOC1ATCTCAACAATAACTOTT 



n9 5 3 2 7 A CTTTTATTTTATAATTTOATAATC^ 



n9 5 3 2 7 AATTOCGATTTTGTTAATCAAATTTATTCTACT * ATGACTAAATGAAATTATTTCTTACA 



ri95327 



T - CTAAlTITCrrAQAAACAC^ATAAQTTATAITAAACTOTTTTCACATT TO ^TTTQAAAQAC 
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SEQUENCE ID NO. 8 



1 


MQKLQLCVYI 


YLPMLIVAGP 


VDLNENSEQK 


ENVEKEGLCN ACTWRQNTKS 


51 


SRIEAIKIQI 


LSKLRLETAP NISKDVIRQL 


LPKAPPLREL 


IDQYDVQRDD 


101 


SSDGSLEDDD 


YHATTETI IT 


MPTESDFLMQ 


VDGKPKCCFF 


KFSSKIQYNK 


151 


WKAQLWIYL 


RPVETPTTVF 


VQILRLIKPM 


KDGTRYTGIR 


SLKLDMNPGT 


201 


GIWQSIDVKT 


VLQNWLKQPE 


SNLGIEIKAL 


DENGHDLAVT 


FPGPGEDGLN 


251 


PFLEVKVTDT 


PKRSRRDFGL 


DCDEHSTESR 


CCRYPLTVDF 


EAFGWEWIIA 


301 


PKRYKANYCS 


GECEFVFLQK 


YPHTHLVHQA NPRGSAGPCC 


TPTKMSPINM 


351 


LYFNGKEQII 


YGKIPAMWD 


RCGCS* 
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SEQUENCE ID NO. 54 

1 GCGGCCGCCC GGGCAGGTAT CGAAAGTTTC ACATATAAAG ATGAATAAGA 

51 TCTAAGTGTA TATGTTATTG TTAATAAAGT TTTTAATTTT TCGAATGTCA 

101 CATACAGCCT TTATTATTCA TAGATTTATT CCTTTTAAGA AGTAGT CAAA 

151 TGAATCAGCT CACCCTTGAC TGTAACAAAA TACTGTTTGG TGACTTGTGA 

201 CAGACAGGGT TTTAACCTCT GACAGCGAGA TTCATTGTGG AGCAAGAGCC 

251 AAT C AC AG AT CCCGACGACA CTTGTCTCATCAAAGTTGGA ATATAAAAAG 

301 CCACTTGGAA TACAGTATAAAACATTCACT GGTGTGGCAA GTTGTCTCTA 

351 GACTGGGCAG GCATTAACGT TTGGCTTGGC GTTACTCAAA AG C AAAAGAA 

4 01 AAG T AAAAG G AAG AAG T AAG AAC AAG G G AA AAGATTGTAT TGATTTTAAA 

4 51 AC C AT£C AAA AACTGCAAAT CTCTGTTTAT ATTTACCTAT TTATGCTGAT 

501 TGTTGCTGGC CCAGTGGATC TGAATGAGAA. CAGCGAGCAG AAGGAAAATG 

551 TGGAAAAAGA GGGGCTGTGT AATGCATGTT TGTGGAGGGA AAAC AC T AC A 

601 TCCTCAAGAC TAGAAGCCAT AAAAATCCAA ATCCTCAGTA AACTTCGCCT 

651 GGAAACAGCT CCTAACATCA GCAAAGATGC TATCAGACAA CTTTTGCCCA 

7 01 AGGCTCCTCC ACTCCTGGAA CTGATTGATC AGTTCGATGT C C AG AG AG AT 
751 GCCAGCAGTG ACGGCTCCTT GG AAG AC GAT GACTACCACG CCAGGACGGA 

8 01 AACGGTCATT ACCATGCCCA CGGAGT/GTGA GTAGTCCTGCTGGTGCAAAG 
851 CAACGACTCT GCTGACTGCT GTTCTAGTGT TCATGAAAAA CCGATCTATT 
901 TTCAGGCTCT TTTAACAAGC TGCTGGCTTG TATGTAAGGA GGAGGGGAAA 
951 GAGCTTTTTT CAAGATTTCA TGAGAAATAG ACCAATGAGA CTGAAAGCTG 

1001 CTACTTTATT TGTTTCCTTA GAG AG C T AAA. AAGCTAAAAA TCAAAAATGA 

■ 

1051 AATGCTTGCA TAGCATTCAT GTTATATAGT T TAG TAT G AC AACTATAACA 

1101 TGTTTATGTT TTCACAGCTT AATGCTACCA AGGTAAAGGA TTGGGAAACA 

1151 GTATCAGCAA TGTGAAAAAT T T AC AT CAAA TTTCCTAATT GCATTTGGTT 
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1201 GCCTGAAATA TGCATTTATA ATAACAGGTT TT TTTTTTTT CATTAATAAA 

1251 AGAGAAAGGA AGAAATCTGT AGAGGTTGAA GCCTATCTGG GCATTTGCTG 

1301 AACACTTAGA ATGACTTCTG TTATTCAAAA CTATTTCTCA TAGGGTTTTT 

1351 ATGGTCTTCA CAGAGTATCT AATTTTGAAA GCTATTAGAG TGGAAAGGAT 

1401 AAAAGAATAT TCTTAATAAA CTTAATGTAT TAGTAAGAGC AATAAGGAAG 

1451 TAAACACAGC ATAGTGAAAA ATCATGAGCT AATCAGCAGA AAATTCTAAG 

1501 AAATAAACAT TTTAATTACA AAGTTCCACT TATACCCTGA CCATGGTACT 

1551 ATTGTTGAGA GTACCTTGTC TGCACATATC TAGGAGGCAC ATGCTTAATA 

1601 ACCTTCTAAA ATATTATTGT ATTCCTCATA GGAGGGAGAA CTATTACCTA 

1651 TAT G TAG T AC CTATGTTGTT TCTGAAAGAT AATATGTTTC ATGTATTTCT 

1701 GTTGCAGTCA CTTCAAACCT ATACTCAAGG AAAGGGAGAC AGG CAT C T CA 

1751 ACAGAGAAGG CATGACCAGA AAGAGTTTTG TGCCATGTGT CTGCGATCTT 

1801 GCTTTATACA GGGCTCTACC CACTTTAAAC TGGACTCAAA ACAGTTTCAA 

1851 AATACTGCTT TTTCTTATTA AGTAACTAGT TTATAAGGCA ACAAATAAAT 

1901 TTCCTTTAAG ACTGTGCTAT CAGATAATCC TGGAATAGAT TTGCCTTACT 

1951 TATAAACAAT CTTGAGAAAA CAAAAAGGCA AG AAAT T G C T AAGTGCTTCT 

2001 GCTTACAATG ACAGCCTGGC CCTAAAGACA ATGTTTTCTA AGTTTTGAAA 

2051 CAGCTTGAAT ACAACATCTA AGTTTTGGTG CTAATTACCT GCTAGTTTTT 

2101 TTATTTTTTT CCTTTAAAAG GCTGTCCCAG CGTCCTAACA TAACAGATGC 

2151 ACTATATTTT CTGCTAATTC CCGAGGCTCA GTTAGTTGCT CACTGTGTCT 

2201 TGTCCCCAGG TAATTCAGGC CTGGGGGAAG GGTTCCTTCC TCCAGACTGA 

2251 TTGGTACAGC TGCTCAGTAA GTGTAACTAC TCAGATTCCC AAAGAATTCT 

2301 AAGTGGATGT TCTTCCACAG TGTCTCTTGT TCTCTCTAAT CATCATCATT 

2351 TTAAAATTTC ATCCACTTTT CATTCCTTAA TAGAATTTTC CTTAGTCCAC 

2401 AGTTCTCTGG AAAGGAAGTA GGCTTCTCAT AACAGCTGAA AAAACATATA 

24 51 CCTAAAAGAT TCTGAAAAGC TGTAATAACT GTTATACTTG ATATTTTGCT 
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2501 GTTATGAATG AAATGCTACA TATTTTTCCA TTTTAAAAGA CTAAATATGC 

2551 ACACATT ATTCCAATT AAAA AATGTTCATA GATTGATATGGAGGTGTTCG 

2601 TTCATTTTTC ATAAAAATGA TCTTAGTAAC TTTTTTTCTT ATTCATTTAT 

2651 AG/CTGATCTT CTAACGCAAGTGGAAGGAAA ACCCAAATGT TGCTTCTTTA 

2701 AATTTAGCTC TAAGATACAA TACAATAAAC TAGTAAAGGC CCAACTGTGG 

2751 AT AT AT C T G A GGCCTGTCAA GACTCCTGCG ACAGTGTTTG TGCAAATCCT 

2801 GAGACTCATC AAACCCATGA AAGACGGTAC AAGGTATACT GGAATCCGAT 

2851 CTCTGAAACT TGACATGAAC CCAGGCACTG GTATTTGGCA GAGCATTGAT 

2901 GTGAAGACAG TGTTGCAGAA CTGGCTCAAA CAACCTGAAT CCAACTTAGG 

2 951 CATTGAAATC AAAGCTTTAG ATGAGAATGG CCATGATCTT GCTGTAACCT 
3001 TCCCAGAACC AGGAGAAGAT GGACTG/GTAA GTGATTACTGAAAATAACAT 
3051 GCTAAAAACC TTGTTATGTG TTTATTCATA ATGTGAATGA ATAGTAGTGA 
3101 AAAATAACTA CCAGTTTCCT GTGCTTATAA GCCAGACAAA GGCACCTTAC 
3151 CCCAGTGGTA GCCCTGTACT CAATAAAAGT AGGTGTCCCA TTTCACATCC 
3201 TATGAAACAC TCTCTTGATA CTTTGACTTT GCATGAGGAT TTAAAAGAAA 
3251 AAAAGTTATA CCATGGTCCT TAAGTTTTTA GGGAATTCTT TGGAATTGAG 
3301 AATGAAATAT AAAATGCTTT CCGTTGATGT GCTACATGAT TATATAAATA 
3351 AAAACATGAA GTCTTCACAG TGGATTCTAG TACTCACCCA ACAACACATT 
34 01 TTTTCCCCCA GAAGAGTGAC CAATTTGTTA AAATTCTTTT GCTTAATAAG 
34 51 GCAGAAAAAT GAACTCTACA AGTTATAATT AAAATAAAAT GCTTTTACTT 
3501 ATAGAAATTA ACTAGATATA TGTTCAGGTT TATATACTAT T AAAT AT AC T 
3551 ATATTTAAGA TCTCTCATGA TAAATATGTT CCTTGTTTTA TAGACTATTG 
3601 ATGCACTGAT GTATATGTGG ATTACTTTGT GAATTACCCC TGGTAAAATT 

3 651 AAAAATTTCA GGCTAGTTAA CTTGTACTAC TTAGCTATTT TCTGAACTGT 
3701 CTTACTGTTC TTTAACAGGA GTTAACTTAG GTAATGTCAA CTAATTTAAT 
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3751 ATAAAGTCAA ACAGAAAATA ATGCCTTATA TATTATAAAA ATTAATAAAA 

3801 AACCATTTTA AAATCTAGTA TAAGTTTAGA GCTACTCACT CTTCTGGCTT 

3851 ATCTATGCTT GTATTTACTT CTGTTTTCAA AAAATTTTTT AATGTGACCA 

3901 TACCTTTTAT TTCCAGTTAT TGATATAATT TACAACAAAA GATTATACTT 

3951 GCAAGCTTTA TAGTTTTTAA ATGGTCTTAT TTGTAGTGAA TATCATATCT 

4001 AAATGATATC TAAATGTAAA GTAAATCATA CCTAAATGAA AACATATTCT 

4051 TTAAGTCATT ATAAAATTTT CCAGGTGATC AATTTTTCTT TAAAT AT AC T 

4101 AC AT AAAAT G TTATTGACTC CCAAAATGAT GTTATTTTGT ATAATCT TAA 

4151 ATACCAATAA TTACCAGGTC TATTTTGGTT TTAGTGTAGG ATAAAAAAGA 

4201 ATGTGTTCTT TTTTCTAGGT AGCATTTTAA TGATCAAAGT TGGTGACGTG 

4251 ACAGAGGTCT TAAG TAT TAT TAAACAGATG ATTAATAAGA TGTATTCCTC 

4301 AGACTTTTCC ATATAAAAGG AAAAATGTCT CAAATTCATG AAAAGATTGG 

4 351 TACAGGAGGA GGATTAGCAA ATTGTAGTTT AAATATCTGA ATGGAAACAC 

4 401 TTTTTAGTGA AAGAATAAAG GGAATATCAT TGTATCTTCT TCTGAGTCTG 

4 4 51 TGCCTCTCTC TCTTGGAGTT AGTCTTTCCA ACCCTATATA CTTACCACTA 

4 501 TCTTCATCCC TCTACCTTCC TTTTTCCCAT TACATCTGTG CAGTACTGGG 

4 551 TGGCAACTAT TGTGTTTCGG TGTTAATATC CAAGTTTCCC TGAATAAGAC 

4 601 CAAGTGAATG GAGGATGAAT GAGTATACCT ATCCCTCCAG GGGTCATCAG 

4 651 ACATATTTAG CC AC CAT ATT TAATCAATAA GCAGGAAGAC ATAAGCTAGC 

4701 CTTGTCCTTC TTCTTTCCTC CCTGCTCCTT TCTCTTCTCT TCCCCCTCTC 

4751 CCTTTACTGT CATCCATCAG TATTTTCAGA GCATCTATTA TGTGTCAGGC 

4 801 ATTCAGATAC TCAAACGGAG GAAAACAAGA ATAAACAAGA CAAAGATCTG 

4 851 ACCACAGGGG AATCCCTATG GCTACTGTAG ACTTTTGAGC CATAAAGGAA 

4901 GAATCAAGCC TAGTGTAAAT GAAAATTCCT TAATGCTGTG CCTT T TAAAA 

4 951 AGAAATGTGA CATAAGCAAA ATGATTAGTTTCTTT£TjrjTA_ATAATGAGTC 

5001 CTTGAGGTAG GAGAGTGTTT TGGGATCTAT TATTAAC TCT TCTTTCCTTT 
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5051 CCATACAG/AC TCCTTTTTTA GAAGTCAAGG TAACAGACAC AC C AAAAAG A 

5101 TCTAGGAGAG ATTTTGGGCT TGATTGTGAT GAACACTCCA CAGAATCTCG 

5151 ATGCTGTCGT TACCCTCTAA CTGTGGATTT TGAAGCTTTT GGATGGGATT 

5201 GGATTATTGC ACCTAAAAGA TATAAGGCCA ATTACTGCTC TGGAGAATGT 

5251 GAATTTGTAT TTTTGCAAAA GTATCCTCAT ACCCATCTTG TGCACCAAGC 

5301 AAACCCCAGA GGTTCAGCCG GCCCCTGCTG TACTCCTACA AAGATGTCTC 

5351 CAATTAATAT GCTATATTTT AATGGCGAAG GACAAATAAT ATACGGGAAG 

5401 ATTCCAGCCA TGGTAGTAGA TCGCTGTGGGTGTTCATSAG GTCTATATTT 

5451 GGTTCATAGC TTCCTCAAAC ATGGAAGGTC TTCCCCTCAA CAATTTTGAA 

5501 ACTGTGAAAT TATGTACCAC AGGCTATAAG C C TAG AG TAT GCTACAGTCA 

5551 CTTAAGCACA AGCTACAGTA TATGAGCTAA AAAGAGAGAA TATATGCAAT 

5601 GGTTGGCATT TAACCATCCA AACAAATCGT ATAATAAAAA GTTTTATGAT 

5651 TTCCAGAGTT TTTGAACTAG GAG AT C AAAT TCCATTTATG TTGAAATATA 

5701 TTACAACACA TGCAGGTGAA TGAAAGCAAT TCTCCTTGTC TTCTGGTGAA 

5751 TTAAAGGAGT ATGCTTTAAA ATCTATTTCT CTACAGTTTC 



i 



INTERNATIONAL SEARCH REPORT . „ . „ 

Aernatlonal Application No 

PCT/IB 98/01197 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/00 C12N15/12 C07K14/495 C12N5/10 C12Q1/68 
A01K67/027 A61K48/00 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C12N A01K C12P C12Q C07K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



MCPHERRON, LAWLER AND LEE: "Regulation of 
skeletal muscle mass in mice by a new 
TGF-beta superfamily member." 
NATURE, 

vol. 387, 1 May 1997, pages 83-90, 
XP002085797 

cited in the application 
see the whole document 



1.3, 

5-13, 

35-47, 

50-66, 

69-74 , 

76-84 , 

86,88-91 

2,4, 

14-34, 

48,49, 

67,68, 

75,85, 

87,92-95 



-/~ 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



° Special categories of cited documents : 

"A" document defining the general state of the art which is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



"T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&" document member of the same patent family 



Date of the actual completion of the international search 



3 December 1998 



Date of mailing of the international search report 



21/12/1998 



Name and mailing address of the ISA 

European Patent Office. P.B. 581 8 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl, 
Fax: (+31 -70) 340-3016 



Authorized officer 



Chambonnet, F 



Form PCTASA/210 (second sheet) (Juty 1992) 



RNATIONAL SEARCH REPORT 




iternational Application No 

PCT/IB 98/01197 



| C.(Contlnuation) DOCUMENTS CONSIDERED TO BE RELEVANT | 


1 Category ° 


Citation ot document, with indication.where appropriate, of the relevant passages 1 


Relevant to claim No. | 


1 Y 


CHARLIER ET AL.: "The mh gene causing 
doubl e-muscl i na in cattle maDs to the 
bovine chromosome 2." 
MAMMALIAN GENOME, 

vol. 6, no. 11, 1995, pages 788-792, 
XP002085798 

cited in the aDDli cation 
see the whole document 


2,4, 

14-34 , 

48,49, 

67,68, 

75,85, 

87,92-95 


X 


WO 94 21681 A (UNIV JOHNS HOPKINS MED ;LEE 
SE JIN (US); MCPHERRON ALEXANDRA C (US) 
29 September 1994 
see SEQ ID 14 


52-59, 
67,68, 
81,82,93 

D I 


P,X 


GROBET ET AL.: "Molecular definition of 

an allelic ^pk*ip<; n*f mtj1"A'Mnn<i Hi ^punti nn 1 

tail O 1 1 C 1 IV Owl ICO VII IMU L 1 wild \J 1 O 1 U yj L 1 1 1 y 1 

the myostatin function and causing 

double-muscling in cattle" ] 
MAMM. GENOME, 

vol. 9, no. 3, March 1998, pages 210-213, 
XP002085799 

see the whole document 


1-95 


P,X 


GROBET ET AL.: "A deletion in the bovine 
myostatin gene causes the double-muscled 
phenotype in cattle." 
NATURE GENETICS 

vol. 17, no. 1, September 1997, pages 
71-74, XP002085800 

& WESTHUSIN, M.: "From mighty mice to 
mighty cows." 
NATURE GENETICS, 

vol. 17, no. 1, September 1997, pages 4-5, 


1-95 


P.X 


MCPHERRON AND LEE* "Double musclina in 

• 1 V«* 1 1 lull 1 \ w 1 * 1 11 1 \J LUL i U wUU t t IIIUJU 1 1 I1U III 

cattle due to mutations in the myostatin 
gene" 

PROC. NATL. ACAD. SCI. USA, 

vol. 94, no. 23, 11 November 1997, pages 

12457-12461, XP002085801 

see the whole document 


1-95 


P,X 


KAMBADUR ET AL.: "Mutations in myostatin 
(GDF8) in double-muscled Belgian Blue and 
; Piedmontese cattle." 
GENOME RESEARCH, 

vol. 7, no. 9, September 1997, pages 
910-916, XP002085802 
see the whole document 

-/- 


1-95 



Form PCT7ISA/210 (continuation of second sheet) (July 1992) 



ATIONAL SEARCH REPORT 




iternational Application No 

PCT/IB 98/01197 



C.(Contlnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • 



P,X 



Citation of document, with indication, where appropriate, ol the relevant passages 



SMITH ET AL.: "Myostatin maps to the 
interval containing the bovine mh locus." 
MAMMALIAN GENOME, 

vol. 8, no. 10, October 1997, pages 
742-744, XP002085803 
see the whole document 

DICKMAN: "Gene mutation provides more 

meat on the hoof." 

SCIENCE, 

vol. 277, no. 5334, 26 September 1997, 
pages 1922-1923, XP002085804 
see the whole document 

WESTHUSIN. M.: "For mighty mice to mighty 

COWS " 

NATURE GENETICS, 

vol. 17, no. 1, September 1997, pages 
71-74, XP002086548 
see the whole document 

WO 98 33887 A (UNIV JOHNS HOPKINS MED) 

6 August 1998 

see the whole document 

GEORGES AND ANDERSSON: "Livestock 
genomics comes of age" 
GENOME RESEARCH, 

vol. 6, 1996, pages 907-921, XP002085805 
cited in the application 
see the whole document 

KAPPES ET AL.: "A second-generation 
linkage map of the bovine genome" 
GENOME RESEARCH, 

vol. 7, 1997, pages 235-249, XP002085806 
cited in the application 
see the whole document 



Relevant to claim No. 



1-95 



1-95 



1-95 



1-95 



1-95 



1-95 



Form PCT/1SA/210 (continuation ol second shoot) (July 1992) 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/IB 98/01197 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. [T] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: As far as claims 1-4 and 60-66 are directed to a diagnostic method 
practised on the human/animal body, the search ahs been carried out and 
based on the alleged effects of the compound/composition. 

2. Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1.1 1 As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
1 — 1 searchable claims. 



As all searchable claims could be searched without effort justifying an additional fee. this Authority did not invitepayment 
of any additional fee. 



3. | [ As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
— covers only those claims for which fees were paid, specifically claims Nos.: 



No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest 



| | The additional search fees were accompanied by the applicant's protest. 
[ | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1))(Jury 1992) 



ATIONAL SEARCH REPORT 



information on patent family members 




iternatlonal Application No 

PCT/IB 98/01197 



Patent document 
cited in search report 


Publication 
date 


Patent family 
members) 


Publication 
date 


WO 9421681 A 


29-09-1994 


CA 


2157577 


A 


29-09-1994 






EP 


0690873 


A 


10-01-1996 






JP 


9507829 


T 


12-08-1997 






US 


5827733 


A 


27-10-1998 



WO 9833887 A 06-08-1998 AU 6274298 A 25-08-1998 



Fwm PCT7ISA/210 (patent (am By annex) {July 1992) 



